Professional Documents
Culture Documents
(b) Describe with the help of a figure the typical process flow within a Data Warehouse. [8+8]
3. (a) Describe the distinct capabilities of a parallel technology of a data warehouse system.
i. degree of parallelism
ii. parallel index build
4. (a) Describe the role of security restrictions once the data warehouse has gone live
(b) What are the audit requirements to impose security restrictions at the beginning of data Ware-
house. [8+8]
5. (a) Discuss with a neat sketch dataflow through data warehouse with reference to tuning the data
load.
6. (a) Describe the class histogram, count matrix and AVC sets. Are they similar in some respect?
[6+2]
(b) Compare ID3 and C4.5 DECISION TREE construction algorithms. [8]
7. (a) What is text clustering? Discuss the principles underlying text clustering.
[2+6]
(b) Discuss the relationship between text mining and information retrieval and information extraction.
[8]
8. What is Event prediction problem? Explain PLANMINE & TIMEWEAVER algorithm. Compare the
PLANMINE & TIMEWEAVER Algorithms. [4+6+6]
?????
Code No: R5410503 2
IV B.Tech I Semester(R05) Supplementary Examinations, May/June 2009
DATA WAREHOUSING AND DATA MINING
(Computer Science & Engineering)
Time: 3 hours Max Marks: 80
Answer any FIVE Questions
All Questions carry equal marks
?????
3. “Design and management of a data warehouse on an MPP system is considerably more difficult than
on an AMP or cluster syste“. Do you support the above statement or not? Justify your stand. [16]
4. (a) Explain the need and role of security on the performance of data warehouse
(b) Describe the impact of security on the design of the data warehouse. [8+8]
5. (a) Is daily processing different from overnight processing for Load estimation process?
(b) What are the system administration requirements of database siting. [10+6]
6. What is a DECISION TREE? With an example, Explain about the CART, ID3 algorithms. Give
comparison between CART & ID3 algorithm. [3+9+4]
7. (a) What is text clustering? Discuss the principles underlying text clustering.
[2+6]
(b) Discuss the relationship between text mining and information retrieval and information extraction.
[8]
8. (a) What is “Constrained Sequence Mining Problem”? In which situation we will use constrained
sequence mining. [8]
(b) Discuss about SPIRIT algorithm. In what way it is different from WUM.
[5+3]
?????
Code No: R5410503 3
IV B.Tech I Semester(R05) Supplementary Examinations, May/June 2009
DATA WAREHOUSING AND DATA MINING
(Computer Science & Engineering)
Time: 3 hours Max Marks: 80
Answer any FIVE Questions
All Questions carry equal marks
?????
1. (a) Explain the ADHOC query and Automation in Data Warehouse delivery process.
(b) Explain to the idea“Can we do without an Enterprise data warehous“? [8+8]
3. (a) Discuss the issues involved in the design of server environments in a data warehouse system.
(b) Describe the design issues involved in the selection of user-front end hardware of a data Warehouse
system. [10+6]
4. (a) Describe the role and importance of backup strategy of a data warehouse.
(b) Explain the role of hardware to implement backup strategy of a data warehouse. [8+8]
6. (a) What is a Decision Tree? What are the advantages and disadvantages of DECISION TREE
classifications? [3+5]
(b) For the given data set create a Decision Tree? And explain about the knowledge obtained from
it. [4+4]
OUTLOOK TEMP(F) HUMIDITY(%) WINDY CLASS
sunny 79 90 True play
sunny 56 70 Flase play
sunny 79 75 True no play
sunny 60 90 True no play
overcast 88 88 False no play
overcast 63 75 True play
overcast 88 95 False play
Rain 78 60 False play
Rain 66 70 False no play
Rain 68 60 True play
7. (a) What are the different types of web mining? How is web usage mining different from web structure
mining and web content mining? [3+5]
(b) What is concept hierarchy? How is it related to web mining? [3+5]
8. (a) What is spatial trend? Explain about the spatial trend detection algorithm.
[3+5]
(b) What is spatial clustering? Write about spatial characterization. [3+5]
?????
Code No: R5410503 4
IV B.Tech I Semester(R05) Supplementary Examinations, May/June 2009
DATA WAREHOUSING AND DATA MINING
(Computer Science & Engineering)
Time: 3 hours Max Marks: 80
Answer any FIVE Questions
All Questions carry equal marks
?????
1. (a) Explain the ADHOC query and Automation in Data Warehouse delivery process.
2. (a) Explain difference between designing a Data Warehouse and an OLTP system.
3. What are the different architectural options available to design server hardware for a data warehouse
system. [16]
4. (a) Why is it important to get all the security and audit requirements clearly documented ?
(b) Explain about the GUILLOTINE CUT phenomenon. What is the advantage of this method
comparing with other. [4+4]
7. (a) Which frequent itemset mining is suitable for text mining and why? Explain?
(b) Discuss the relationship between text mining and information retrieval and information extraction.
[8+8]
8. (a) What is “Constrained Sequence Mining Problem”? In which situation we will use constrained
sequence mining.
?????