You are on page 1of 15

KONSEP DATA WAREHOUSE

Fakultas Teknologi Informasi, UNTAR 2017


The Evolution of Data Warehousing

Diperolehnya keunggulan kompetitif melalui otomatisasi sistem proses


bisnis pada perusahaan, sejak 1970an.
Akumulasi jumlah data dalam basis data operasional tumbuh lebih
banyak.
Organisasi sekarang ini berfokus pada cara menggunakan data
operasional untuk mendukung pengambilan keputusan, sebagai sarana
untuk mendapatkan keunggulan kompetitif.
Organisasi perlu untuk mengaktifkan data arsipnya menjadi sumber
pengetahuan, sehingga terintegrasi pada tempat tunggal dan
terkonsolidasnya tampilan data organisasi yang disajikan kepada
pengguna
Sebuah data warehouse dianggap sebagai solusi untuk memenuhi
persyaratan sistem yang mampu mendukung pengambilan keputusan,
dan menerima data dari berbagai sumber data operasional
What is a Data Warehouse?

A data warehouse is simply a single, complete, and


consistent store of data obtained from a variety of sources
and made available to end users in a way they can
understand and use it in a business context.
-- Barry Devlin, IBM Consultant

A Data Warehouse is a subject-oriented, integrated, time-


varying, non-volatile collection of data that is used primarily in
organizational decision making.
-- W.H. Inmon, Building the Data Warehouse, 1992
Subject-Oriented Data
Warehouse is organized around major subjects of the
enterprise (e.g. customers, products, sales) rather
than major application areas (e.g. customer
invoicing, stock control, product sales).
This is reflected in the need to store decision-support
data rather than application-oriented data.
Integrated Data
The data warehouse integrates corporate
application-oriented data from different source
systems, which often includes data that is
inconsistent.
The integrated data source must be made consistent
to present a unified view of the data to the users.
Non volatile data
Data in the warehouse is not updated in real-time
but is refreshed from operational systems on a regular
basis.
New data is always added as a supplement to the
database, rather than a replacement.
Terminoloy
Data warehouses have the distinguishing characteristic that
they are mainly intended for decision support applications.
Traditional databases are transactional.
Applications that data warehouse supports are:
OLAP (Online Analytical Processing) is a term used to
describe the analysis of complex data from the data
warehouse.
DSS (Decision Support Systems) also known as EIS (Executive
Information Systems) supports organizations leading
decision makers for making complex and important
decisions.
Data Mining is used for knowledge discovery, the process of
searching data for unanticipated new knowledge.
Conceptual Sturucture of Data Warehouse
Data Warehouse processing involves
Cleaning and reformatting of data
OLAP
Data Mining Back Flushing
Data Warehouse

OLAP
Data
Cleaning Reformatting DSSI
Databases EIS
Metadata
Data
Mining

Other Data Inputs Updates/New Data


Comparison of OLTP Systems and Data Warehousing
Data Warehouse Queries
Types of queries that a data warehouse is expected
to answer ranges from the relatively simple to the
highly complex and is dependent on the type of
end-user access tools used.
End-user access tools include:
Reporting, query, and application development tools
Executive information systems (EIS)
OLAP tools
Data mining tools
Typical Architecture of a Data Warehouse
Architecture Components of Data Warehouse

Operational Data Store (ODS)


(a repository of integrated operational data to be moved into
the warehouse)
Load manager
(extracts data from data sources or ODS and loads data into the
warehouse)
Warehouse manager
(ensures data consistency, transformation, and merging, sets
indexes and views, generation of de-normalizations and
aggregations, backing-up and archiving data)
Architecture Components of Data Warehouse

Query manager
(manages user queries using vendor end-user data access tools,
data warehouse monitoring tools, database facilities, and
custom-built programs)
End-user access tools
(tools for reporting, query, OLAP and data mining tools)
Data mart
(a warehouse part for one business obj. of a particular dept. or
business function
Problems of Data Warehousing (cont..)
Data ownership
(sensitive data may be made accessible after loading into
a warehouse)
High maintenance
(modifications in the source systems may affect the data
warehouse)
Long-duration projects
(building of a large warehouse can take up to three years)
Complexity of integration
(integration of the input data as well as tools for ETL is quite
difficult
QUESTIONS ?

You might also like