You are on page 1of 2

Proceedings of the 26th Academic Council held on 18.5.

2012

  CSC522 DATA WAREHOUSING AND DATA MINING LTPC 3 0 0 3

Course
Prerequisite: Database Management Systems

Objective
To help students: understand the fundamental processes, concepts and techniques of data
mining and develop an appreciation for the inherent complexity of the data-mining task;
advance relevant programming skills; and advance research skills through the investigation of
data-mining literature.
 
Expected Outcome 
On completion of this course student should have gained a good understanding of the basic
concepts, principles and techniques of data mining. Specifically, student should be able to:
1. Define knowledge discovery and data mining
2. Recognize the key areas and issues in data mining
3. Apply the techniques of clustering, classification, association finding, feature selection and
visualisation to real world data
4. Determine whether a real world problem has a data mining solution
5. Apply evaluation metrics to select data mining techniques  

Unit No. I: FUNDAMENTALS OF DATA MINING  


Introduction to Data Mining – Data Mining Functionalities, Steps in Data Mining Process –
Architecture of a Typical Data Mining Systems – Classification of Data Mining systems, Data
Mining Task primitives, Major issues in Data mining.

Unit No. II: DATA PREPROCESSING AND ASSOCIATION RULES


Data Preprocessing – Data Cleaning – Integration – Transformation – Reduction –– Concept
Description Data Generalization and Summarization Based Characterization – Mining
Association Rules in Large Databases.
Mining Frequent Patterns - basic concepts - Efficient and scalable frequent item set mining
methods, Apriori algorithm, FP-Growth algorithm, Associations - mining various kinds of
association rules.

Unit No. III: PREDICTIVE MODELING 


Classification and Prediction Issues Regarding Classification and Prediction – Classification by
Decision Tree Induction – Bayesian Classification – Other Classification Methods – Prediction –
Clusters Analysis – Basics of cluster analysis -Types of Data in Cluster Analysis – Categorization
of Major Clustering Methods – Partitioning Methods – Hierarchical Methods.

Unit No. IV: DATA WAREHOUSING 


Data Warehousing Components – Multi Dimensional Data Model – Data Warehouse Architecture
– Data Warehouse Implementation – Mapping the Data Warehouse to Multiprocessor
Architecture – OLAP – Need – Categorization of OLAP Tools. Uses of data warehouse.

Unit No. V: APPLICATIONS 


Applications of Data Mining – Social Impacts of Data Mining – Tools – An Introduction to DB
Miner – Case studies – Mining WWW – Mining Text Databases – Mining Spatial Databases.

Text / Reference Books  
1. Jiawei Han and Micheline Kambers, “Data Mining –Concepts and Techniques”, 2nd edition,
Morgan Kaufman Publications, 2011.
2. David Hand, Heikki Mannila and Prdhraic Smyth, “Principles of Data Mining”, 3rd edition,
Morgan Kaufman Publications, 2009.

401
Proceedings of the 26th Academic Council held on 18.5.2012

3. M. Kantardzic, “Data Mining: Concepts, Models, Methods, and Algorithms”, 2nd edition,
Wiley-IEEE Press, 2011.

Mode of Evaluation : By assignments, and Continuous Assessment


Tests(CAT) 
Recommended by the Board of
Studies on

Date of Approval by the Academic


Council
 

402

You might also like