You are on page 1of 23

DATA WAREHOUSING & DATA MINING

Subject : MIS Prof. : Mr. Awesh A. Bhornya

Compiled by:NAME ROLL NO. -----------------------------------------------------------------------------------------ABBAS KAGDI 01 ALFIYA SAYED 02 ANSARI MOHD. SHOEB 03 ANSARI TAUSEEF 04 ANSARI YOUSUF 05 ARBAZ AHMED 06

DATA WAREHOUSING
Definition
A process of transforming data into information and making it available to users in a timely enough manner to make a difference

Data Warehousing

FUNCTIONS OF DATA WAREHOUSING

Data Warehousing
OLTP

Most database operations involve On-Line Transaction Processing (OLTP).


Short, simple, frequent queries and/or modifications, each involving a small number of tuples. Examples: Answering queries from a Web interface, sales at cash registers, selling airline tickets.

Data Warehousing OLTP vs Data Warehouse


OLTP
1. 2. 3. 4. 5. 6. 7. Application Oriented Used To Run Business Detailed Data Current Up To Date Isolated Data Repetitive Access Clerical User

Warehouse (DSS)
1. 2. 3. 4. 5. 6. 7. Subject Oriented Used To Analyze Business Summarized And Refined Snapshot Data Integrated Data Ad-hoc Access Knowledge User (Manager)

Data Warehousing To summarize ...


OLTP Systems are used to run a business

The Data Warehouse helps to optimize the business

Data Warehousing
OLAP Of increasing importance are On-Line Application Processing (OLAP) queries.

Few, but complex queries may run for hours. Queries do not depend on having an absolutely up-to-date database.

Data Warehousing
OLAP Examples 1. Amazon analyzes purchases by its customers to come up with an individual screen with products of likely interest to the customer. Analysts at Wal-Mart look for items with increasing sales in some region.

2.

Data Warehousing

Who in the World Needs a Data Warehouse? Why Do We Need Data Warehousing?

Data Warehousing
Typical Data Warehouse Questions

Data Warehousing
Fundamental Business Questions

Data Warehousing
ARTICLE - 1 Getting the most out of data warehousing Prabhakar Deshpande, TNN Jun 10, 2004, 12.37am IST Information Technology is the technology to extract, generate and distribute information. And information is very clearly defined as meaningful data. Information is data that has been processed into a form that is meaningful to the recipient. It should be very evident that data management should be the core of any information technology thinking. Data warehousing and data mining are avenues in data management to get better information. Data warehouse stores information from various databases into a single location - cleaned and processed into the right formats for analysis. The process of transforming raw data into data warehouse involves steps such as extraction getting data out of original database and transferring it to database infrastructure. Consolidation is process of combining data from several sources into one database. Cleansing is the process of correcting data.

Data Warehousing
ARTICLE - 2 After core banking, PSBs try data warehousing Aniruddha Ghosh, TNN May 8, 2007, 04.10am IST A couple of years after embarking on their core-banking strategies, most public sector banks are well on track to complete networking of their branches and provide 'anywhere anytime' banking. However, competition from new-age banks and MNCs is now forcing them to look beyond core-banking solutions. PSU banks are now getting into data warehousing and dynamic customer profiling to analyze customer behavior and improve marketing strategy. Globally, banks use data warehousing solutions for various functions, including measurement of performance, analyzing profitability, managing risks and compliance requirements, reporting on regulatory norms and customer-relationship management. Deployment of a data warehouse and business intelligence capability is the next logical step for public-sector banks, feel most bankers. "Core banking will give us a common platform for providing various other initiatives," feels SK Sehgal, SBI's general manager-IT. "The network will facilitate international remittances as well as payments of insurance premium etc," he added. Implementing CBS has also enabled banks release some of their staff to market products and increase cross-selling, said an official from Bank of India.

Data Warehousing
STEPS in Building a Data Warehouse
Extracting the transactional data from the data sources into a staging area Transforming the transactional data Loading the transformed data into a dimensional database Building pre-calculated summary values to speed up report generation Building (or purchasing) a front-end reporting tool

DATA MINING
Definition
Data mining (sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it into useful information, information that can be used to increase revenue, cuts costs, or both.

Data Mining
Functions
Data Collection Data Scrubbing Pre-testing Analysis/Training Model Building Application

Techniques And Tools Include, For Example


Decision Tree Learning Bayesian Classification Neural Networks

Data Mining Need for Data Mining Data Mining Challenges


Increasing data dimensionality and data size. Various forms of data. New types of data like streaming data and multimedia data. Efficiency in data access and information search methods. Intelligent upgrade and integration methods.

Data Mining
ARTICLE - 1 Technology Data mining student performance February 29, 2000 IDG A lot of companies use data mining to comb through databases to figure out which of their products sells best and where. Now a public school district is using equally sophisticated technology to analyze student performance. Floridas Broward County School District is outfitting each of its 207 schools with an IBM AS400 running IBMs DB2 so school administrators can record student test scores and absenteeism, and analyze trends using IBMs desktop data mining tools. ; The goal is to give administrators an easy way to get a historical view of each childs academic performance, says Nancy Terrell, director of strategic planning and accounting for the district.

Data Mining
ARTICLE - 2 iGate-Patni ties up with Rio Tinto to provide R&D and engineering services Agencies Mar 27, 2012, 02.49PM IST BANGALORE: Patni Computer Systems Ltd, the software services provider controlled by iGate Corp, won an outsourcing order from Rio Tinto PLC, the world's thirdlargest mining company, the two companies said in a statement. Rio Tinto expects to spend around $60 million to $80 million over the next five years of the partnership, the statement said on Tuesday.

Data Mining

Steps in Data Mining


Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment

CONCLUSION

You might also like