You are on page 1of 2

NARAYANA ENGINEERING COLLEGE :: NELLORE SUBJECT : DATA MINING & DATA WAREHOUSING ACADEMIC YEAR : 2008 2009 YEAR

R : IV B.TECH. BRACH : CSE FACULTY : R.RAJANI

UNIT 1 1. 2. (a) Draw and explain the architecture for on-line analytical mining. (b) Briefly discuss the data warehouse applications. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing.

UNIT 2 1. (a) Briefly discuss the role of data cube aggregation and dimension reduction in the data reduction process. (a) Briefly explain about data integration. (b) Briefly discuss about data transformation.

2.

UNIT 3 1. Write the syntax for the following data mining primitives : (a) Task-relevant data (b) Concept hierarchies. (a) Describe why is it important to have a data mining query language. (b) The four major types of concept hierarchies are : schema hierarchies, setgrouping hierarchies, operation-derived hierarchies, and rule-based hierarchies. Briefly define each type of hierarchy.

2.

UNIT 4 1. Write short notes for the following in detail : (a) Measuring the central tendency (b) Measuring the dispersion of data. (a) Write the algorithm for attribute-oriented induction. Explain the steps involved in it. (b) How can concept description mining be performed incrementally and in a distributed manner.

2.

UNIT 5 1. (a) Write the FP-growth algorithm. Explain. (b)What is an iceberg query? Explain with example.

2. Explain the Apriori algorithm with example.

UNIT 6 1. (a) What is classification? What is prediction? (b) What is Bayes theorem? Explain about Nave Bayesian classification? (c) Discuss about K-Nearest neighbor classifiers and case-based reasoning. Discuss about Back propagation classification.

2.

UNIT 7 1. 2. (a) Write algorithms for K-Means and K-Mediods. Explain. (b) Discuss about density-based methods. (a) Given two objects represented by the tuples (22,1,42,10) and (20,0,36,8) : (i) Compute the Euclidean distance between the two objects. (ii) Compute the Manhattan distance between the two objects. (iii) Compute the Minkowski distance between the two objects, using q=3.

(b)Explain about Statistical-based outlier detection and deviation-based outlier detection.

UNIT 8 1. Explain the following : (a) Construction and mining of object cubes. (b) Mining associations in multimedia data. (c) Periodicity analysis (d) Latent semantic indexing. (a) Give an example of generalization-based mining of plan databases by divideand-conquer. (b) What is sequential pattern matching? Explain. (c) Explain the construction of a multilayered web information base.

2.

You might also like