Professional Documents
Culture Documents
Topics Covered
What is a data warehouse Definition of a data warehouse Why organizations use data warehousing? OLTP vs. OLAP Dimensional Modeling Dimensions and Measures Types of data warehouses Data warehouse schemas and other basics
6 June 2011
6 June 2011
Store of data obtained from variety of sources and made available to end users in a way that they can understand and use in a business context.
6 June 2011
6 June 2011
6 June 2011
6 June 2011
OLAP
Knowledge worker decision support subject-oriented historical, summarized multidimensional integrated, consolidated ad-hoc lots of scans complex query millions hundreds 100GB-TB query throughput, response
6 June 2011
Few
Indexes
Many
Many
Joins
Some
Normalized DBMS
Denormalized DBMS
Rare
Common
6 June 2011
Dimensional Modeling
Dimensional Modeling is a different approach to database design. Features of Dimensional Modeling are: Highly denormalized schema Data is contained in 2 types of tables: Dimension and Fact tables Usually dimension tables have large number of columns and lesser number of rows. Usually fact tables have lesser number of columns and large number of rows.
6 June 2011
6 June 2011
For Example
Location Dimension - Table Schema
Field Name
Dim_Id Loc_Code Name State_Name County_Name Integer(4) Varchar(4)
Type
6 June 2011
For Example
Location Dimension - Table Data
Dim_Id
1001 1002 1003 1004
Loc_Code
IL01 IL02 MX01 TO01
Name
Chicago Loop Brooklyn Mexico City Toronto
State_Name
Illinois New York Distirto Federal Ontario
Country_Name
USA USA Mexico Canada
6 June 2011
6 June 2011
For Example
Monthly Sales Fact - Table Schema
Field Name
TM_Dim_Id PR_Dim_Id LOC_Dim_Id Sales Tax Integer(4) Integer(4) Integer(4) Integer(4) Integer(4)
Type
6 June 2011
For Example
Monthly Sales Fact - Table Data
TM_Dim_Id
1001 1002 1003 1001
PR_Dim_Id
1001 1002 1001 1004
LOC_Dim_Id
1003 1001 1003 1001
Sales
89513383 25468926 777215631 65894001
Tax
8900 2512 7796 6574
6 June 2011
Analysis
Summary data
Raw data
Reporting
Flat files
Data mining
6 June 2011
Analysis
Summary data
Raw data
Reporting
Flat files
Data mining
6 June 2011
Purchasing
Analysis
Summary data
Raw data
Sales
Reporting
Flat files
Inventory
Data mining
6 June 2011
6 June 2011
Time Dimension
Customer Dimension
Store Dimension
6 June 2011
Product
Product ID Prod Name Prod Desc Category QOH
Store
Store ID City State Country Region
Customer
Customer ID Cust Name Cust City
Measurements
Cust Country
6 June 2011
Product
Sub Cat
Store
Category State
City
6 June 2011
Sub Cat
Sub cat ID
Month Date
Month ID Month Year ID Date ID Date Month ID
Category
Cat ID Category Cat ID
City State
State ID State Country ID City ID City State ID Store ID Store City ID
Customer ID
Store
Country
Country ID Country
Measurements
Cust Country
6 June 2011
Store Dimension
6 June 2011
Customer
Customer ID Cust Name Cust City Cust Country
Measurements
Fcst_Weight_net Fcst_Turnover
6 June 2011
Thank You
6 June 2011