Views |
|
InformationTechnology:Database:DataWarehouse
[edit] Navigation[edit] Related categories[edit] About this pageWe apologize for the little information we provide, this page is still under construction. Please stay tuned.
[edit] Data Warehousing overviewThe term "data warehouse" has become one of the most used and abused terms in the IT vocabulary. A data warehouse can be any collection of summarised data from various sources, structured and optimised for query access using OLAP (On-Line Analytical Processing) query tools. A data warehouse is virtually any database containing data from more than one source, collected for the purpose of providing management information. The term "data warehousing" was coined by the mid-1980s and, in essence, it was referring to an architectural model for the flow of data from operational systems to decision support environments. Based on analogies with real warehouses, data warehouses were initially intended as large-scale collection/storage/staging areas for bulk-supplied data, in provenance from the various operational systems. Once stored in the data warehouse, data could be classified and distributed to "data marts" (equivalent to "retail stores") to be accessed by Decision Support Systems (DSS) and users. Along the way this architectural vision changed and the term data warehouse started to be used to refer to OLAP-enabled decision-support databases. The costs of data warehousing projects are usually very high, driven up primarily by the requirement to collect, "clean" and integrate data from different sources, often legacy systems. The cost of extracting, cleaning and integrating data can eat-up to 60-80% of the total cost of a data warehousing project or decision support project. On the tooling side, the costs associated with OLAP tools, data integration technology, data extraction tools, graphical user query tools, etc, represent a small proportion of a project's total cost.A data warehouse is populated through a series of steps represented by the acronym ETL (Extract, Transform and Load)
Once the data is made available in the Data Warehouse, it has to be processed.It is important to distinguish the capabilities of a Data Warehouse from those of an OLAP (On-Line Analytical Processing) system. In contrast to a Data Warehouse, which is usually based on relational technology, OLAP uses a multidimensional view of aggregate data to provide quick access to strategic information for further analysis. A Data Warehouse Architecture (DWA) represents the structure of data, flows, processing and presentation that exists and is made up of a number of interconnected parts.
Two of the pioneers in the Data Warehouse field were Ralph Kimball [1] and Bill Inmon [2] and many of the Data Warehouse terms and concepts were coined defined by them. [edit] Data Warehousing resourcesHere are some general resources on Data Warehousing
[edit] OLAP productsChoosing an OLAP product among the choice of OLAP tools from different vendors is no easy task because the metadata formats tend to be proprietary. Recent efforts by the Object Modeling Group (OMG) [14] resulted in a Common Warehouse Metamodel specification. The Common Warehouse Metamodel (CWM) describes metadata interchange among data warehousing, business intelligence, knowledge management and portal technologies.There are 2 types of OLAP
OLAP applications are generally characterized by the following common features
[edit] OLAP resourcesHere are some resources on the subject
[edit] OLAP products on the marketHere are some popular OLAP products
[edit] Reporting tools on the marketTwo of the most popular reporting toolsets are |
|
| This page was last modified 01:43, 15 March 2008. |