A Comparison of
Data Warehousing
Methodologies
Using a common set of attributes to determine which methodology to use in a particular data warehousing project.
DATA INTEGRATION TECHNOLOGIES have experienced explosive growth in the last few years, and data warehousing has played a major role in the integration process. A data warehouse is a subjectoriented, integrated, time-variant, and nonvolatile collection of data that supports managerial decision making [4]. Data warehousing has been cited as the highest-priority post-millennium project of more than half of IT executives. A large number of data warehousing methodologies and tools are available to support the growing market. However, with so many methodologies to choose from, a major concern for many firms is which one to employ in a given data warehousing project. In this article, we review and compare several prominent data warehousing methodologies based on a common set of attributes.
Online transaction processing (OLTP) systems are useful for addressing the operational data needs of a firm. However, they are not well suited for supporting decision-support queries or business questions that managers typically need to address. Such questions involve analytics including aggregation, drilldown, and slicing/dicing of data, which are best supported by online analytical processing (OLAP) systems.
Data warehouses support OLAP applications by storing and maintaining data in multidimensional format. Data in an OLAP warehouse is extracted and loaded from multiple OLTP data sources (including
DB2, Oracle, IMS databases, and flat files) using
Extract, Transfer, and Load (ETL) tools.
The warehouse is located in a presentation server.
It can span enterprisewide data needs or can be a collection of “conforming” data marts [8]. Data marts
(subsets of data warehouses) are conformed by following a standard set of attribute declarations called a
data
References: 2. DCI Seminar Workbook—Strategies and Tools for Successful Data Warehouses. DCI, Andover, MA, 1999; www.dciexpo.com. Addison-Wesley, Reading, MA, 1997. 4. Inmon, W.H. Building the Data Warehouse, 3rd edition. Wiley, New York, 2002. 5. Inmon, W. Metadata in the data warehouse: A statement of vision. White Paper, Tech Topic 10, Pine Cone Systems, Colorado, 1997; www.inmoncif.com/library/whiteprs/techtopic/tt10.pdf. 7. Inmon, W. Metadata in the data warehouse, White Paper, 2000; www.inmoncif.com/library/whiteprs/earlywp/ttmeta.pdf. 8. Kimball, R. and Ross, M. The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, 2nd edition, Wiley, New York, 2002. 9. Kimball, R., Reeves, L., Ross, M., and Thronthwaite, W. The Data Warehouse Lifecycle Toolkit. Wiley, New York, 1998.