Professional Documents
Culture Documents
Warehouse
Agenda
Data Warehouse
Data Warehouse
Data Warehouse
Data Warehouse
Architecture
ODS 1
Query
Meta-data
ODS 2
Lightly
summarized
data
Load
Manager
Detailed data
High
Summarized
data
Manager
DBMS
Reporting,
query,
application
development,
and EIS tools
OLAP tools
ODS 3
Operational data
store (ODS)
Warehouse Manager
Data mining
Archive/backup
data
Operational datastore(ODS)
It is a repository of current and integrated operational data
used for analysis.
Data Warehouse
Meta-data
Data Warehouse
Data Warehouse
Data flows
Data Warehouse
Reporting, query,application
development, and EIS (executive
information system) tools
Warehouse Manager
Operational
data source1
Meta-flow
High
summarized data
Meta-data
Inflow
Lightly
summarized
data
Load
Manager
Operational
data source n
Operational
data store (ods)
Outflow
Upflow
DBMS
Detailed data
Warehouse Manager
Downflow
Archive/backup
data
Issues to be addressed in
Building Data Warehouse
Data Warehouse
Warehouse Schema
Fact Table:
Stores the business data. Data in fact table is
called Fact. They contain multidimensional data.
Dimension Table:
To minimize storage requirements, dimension
attributes are usually short identifiers that are
foreign keys into other tables called Dimension
Table
Data Warehouse
PRODUCT
Area 1
AREA
Area 2
DURATION
Area 3
Year
Beginning
Date
Completion
Date
Data Warehouse
Star Schema
Dimension
Table:
AREA
Dimension
Table:
Fact Table
SALES
TIME
Dimension
Table:
CUSTOMER
Data Warehouse
Dimension Tables
Region_Dimension_Table
region _id region _doc
NE
NW
SE
SW
Product_Dimension_Table
prod_grp_id prod_id prod_grp_desc prod_desc
10
20
30
100
140
220
Fewer devices
Circuit boards
Components
Northeast
Northwest
Southeast
Southwest
Power supply
Motherboard
Co-processor
100000
100000
ABC
ABCElectronics
Electronics
110000
110000
Midway
Electric
Midway
Electric
120000
120000
Victor
Components
Victor
Components
130000
130000
Washburn,
Inc.
Washburn,
140000
140000
Zerox
Zerox
Inc.
Account_Dimension_Table
month
month
prod_id
prod_id
region_id
region_id
01-1996
01-1996
02-1996
02-1996
03-1996
03-1996
100
100
140
140
220
220
SW
SW
NE
NE
SW
SW
account_id
account_id
100000
100000
110000
110000
100000
100000
vend_id
vend_id net-sales
net-sales
100
100
200
200
300
300
30,000
30,000
23,000
23,000
32,000
32,000
gross_sales
gross_sales
50,000
50,000
42,000
42,000
49,000
49,000
Fact Table
Monthly_Sales_Summary_Table
Vendor_Dimension_Table
month
month
mo_in_fiscal_yr
mo_in_fiscal_yr
month_name
month_name
vend_id
vend_id vendor_desc
vendor_desc
01-1996
01-1996
4
4
January
January
02-1996
02-1996
5
5
February
February
100
100
PowerAge,
Inc.
PowerAge,
03-1996
03-1996
6
6
March
March
200
200
Advanced MicroMicro
DevicesDevices
Advanced
300
300
Farad Incorporated
Farad
Incorporated
Time_Dimension_Table
Data Warehouse
Inc.
Snowflake Schema
Unmanageable Data
Difficult to Retrieve Data
Metadata become Complex
Data Warehouse
Snowflake Schema
Product Category
Product
Manufacturer
Dimension
Table
PRODUCT
Dimension
Table
AREA
Dimension
Table
Fact Table
SALES
TIME
Dimension
Table
CUSTOMER
Data Warehouse
Starflake Schema
Data Warehouse
Starflak
e
Schema
Price
Snowflake
Dimension
Star Dimension
Product
Weight
Product
Fact Table
SALES
Star dimension
Location
Location
Location 1
Location 2
Data Warehouse
Data Warehouse
Data Warehouse
Problems
Data Warehouse
Data mart
Warehouse Manager
Operational
data source1
Highly
summarized data
Meta-data
ODS 1
Lightly
summarized
data
Load
Manager
ODS 2
Reporting, query,application
development, and EIS tools
Query
Manager
Detailed data
DBMS
OLAP tools
ODS 3
Warehouse Manager
(First Tier)
Data mining
Archive/backup
data
summarized
Data
Data Mart
(Relational database)
Summarized data
(Multi-dimension
database)
Data Warehouse
(Second Tier)
most often
Data Mining
Data Warehouse