Professional Documents
Culture Documents
Surajit Chaudhuri
Microsoft Reserch, Redmond
Umeshwar Dayal
HP Labs, Palo Alto
Outline
Introduction Need of Data Warehousing and OLAP Architecture of Data Warehousing Front-Back End Tools Database Design Methodology Conclusion
Decision support requires historical data which operational Databases do not typically maintain
OLAP
Tiered Architecture
External Sources Operational Databases
Extract Transform Load Refresh
Tier3: Clients
Data Warehouse
Serve
Data Marts
Data Sources
Data Storage
Conceptual Model
Date
TV PC PVR sum 1
sum
ALL
Country
Star Schema
Snowflake Schema
A refinement of star
schema where
hierarchy is normalized into a set
of smaller dimension
tables, forming a shape similar to snowflake
Star Schema
Time
T_key T_day T_day_week T_month T_quarter T_year
Branch
B_key B_name B_type
location
location_key street city province country
Dollars_sold
Avg_sales
Star Schema
Snowflake Schema
Time
T_key T_day T_day_week T_month T_quarter T_year
Branch
B_key B_name B_type
Location
location_key street city City C_key C_city C_province C_country
dollars_sold
avg_sales
Snowflake Schema
Summary
Data warehouse
A subject-oriented, integrated, time-variant, and nonvolatile collection of data in support of managements decision-making process
OLAP operations: drilling, rolling, slicing, dicing and pivoting Multi dimensional model of Data warehouse
Data cube Star Schema Snowflake Schema
Thank You