You are on page 1of 1

Code No: RR410508 RR

IV B.Tech I Semester(RR) Supplementary Examinations, November 2010


DATA MINING AND DATA WAREHOUSING
(Common to Computer Science & Engineering and Information Technology)
Time: 3 hours Max Marks: 80
Answer any FIVE Questions
All Questions carry equal marks
?????

1. (a) What is a Data Warehouse? Discuss in detail.


(b) Describe with the help of a figure the typical process flow within a Data Warehouse. [8+8]
2. (a) When is a summary table too big to be useful ?
(b) Relate and discuss the various degrees of aggregation within summary tables.
[8+8]
3. Write short notes on the following:
(a) Rule based optimizer
(b) Data shipping
(c) Shared disk systems
(d) Distributed lock manager. [4+4+4+4]
4. (a) Write the basic terminology and involved in backup strategies.
(b) Describe the effect on database design while implementing backup strategy.
[8+8]
5. How much CPU bandwidth is required and explain why? [16]
6. What is splitting criteria? With an example explain about the [2]
(a) Class Histogram, and [7]
(b) Count Matrix. [7]
7. (a) What is text clustering? Discuss the principles underlying text clustering.
[2+6]
(b) What are the different types of web mining? How is web usage mining different from web structure
mining and web content mining? [3+5]
8. (a) What is time series analysis? What is n-series? Write in detail about similarity function. [3+2+3]
(b) What is feature extraction from time series? Discuss about the major problems with these features
extraction techniques. [3+5]

?????

You might also like