You are on page 1of 7

VIVEKANANDA INSTITUTE OF TECHNOLOGY & SCIENCE

(COLLEGE CODE: N6)


Opp. to Housing Board Colony, Bye Pass Road Karimnagar-505 001

I MIDTERM OBJECTIVE QUESTION PAPER

CLASS: IV B. Tech I – Semester SUBJECT: DWDM

Branch: CSE TIME : 10.00 AM TO 11.30 AM

1. Which of the following is the most important when deciding on the data structure of a data mart? [ ]
(a) XML data exchange standards (b) Data access tools to be used
(c) Metadata naming conventions (d) Extract, Transform, and Load (ETL) tool to be used
(e) All (a), (b), (c) and (d) above.

2. The process of removing the deficiencies and loopholes in the data is called as [ ]
(a) Aggregation of data (b) Extracting of data
(c) Cleaning up of data. (d) Loading of data (e) Compression of data.

3. Which one manages both current and historic transactions? [ ]


(a) OLTP (b) OLAP (c) Spread sheet (d) XML (e) All

4. Which of the following is the collection of data objects that are similar to one another within the same group?[ ]
(a) Partitioning (b) Grid (c) Cluster d) Table (e) Data source.

5. Which of the following employees data mining techniques to analyze the intent of a user query, provided
additional generalized or associated information relevant to the query? [ ]
(a) Iceberg query method (b) Data analyzer (c) Intelligent query answering (d) DBA
6. Which of the following process includes data cleaning, data integration, data selection, data transformation, data
mining, pattern evolution and knowledge presentation? [ ]
(a) KDD process (b) ETL process (c) KTL process (d) MDX process (e) None

7. At which level we can create dimensional models? [ ]


(a) Business requirements level (b) Architecture models level (c) Detailed models level
(d) Implementation level (e) Testing level.

8. Which of the following is not related to dimension table attributes? []


(a) Verbose (b) Descriptive (c) Equally unavailable (d) Complete

9. Data warehouse bus matrix is a combination of [ ]


(a) Dimensions and data marts (b) Dimensions and facts
(c) Facts and data marts (d) Dimensions and detailed facts

10. Which of the following is not the managing issue in the modeling process? [ ]
(a) Content of primary units column (b) Document each candidate data source
(c) Do regions report to zones (e) Ensure that the transaction edit flat is used for analysis.

11. The full form of OLAP is [ ]


A) Online Analytical Processing B) Online Advanced Processing
C) Online Advanced Preparation D) Online Analytical Performance

12. ……………………. is a subject-oriented, integrated,time-variant, nonvolatile collection or data in support of


management decisions.
A) Data Mining B) Data Warehousing C) Document Mining D) Text Mining

13. The data is stored, retrieved and updated in ………………..


A) OLAP B) OLTP C) SMTP D) FTP

14. An ……………… system is market-oriented and is used for data analysis by knowledge workers, including
managers, executives, and analysts. [ ]
A) OLAP B) OLTP C) Both of the above D) None of the above

15. …………………… is a good alternative to the star schema. [ ]


A) Star schema B) Snowflake schema C) Fact constellation D) Star-snowflake schema

16. The …………. exposes the information being captured, stored, and managed by operational systems.[ ]
A) top-down view B) data warehouse view C) data source view D) business query view
17. The type of relationship in star schema is ……………
A) many to many B) one to one C) one to many D) many to one
18. The ……………… allows the selection of the relevant information necessary for the data warehouse.[ ]
A) top-down view B) data warehouse view C) data source view D) business query view
19. Which of the following is not a component of a data warehouse? [ ]
A) Metadata B) Current detail data C) Lightly summarized data D) Component Key
20. Which of the following is not a kind of data warehouse application? [ ]
A) Information processing B) Analytical processing C) Data mining D) Transaction processing
21.Data scrubbing is which of the following? [ ]
A.A process to reject data from the data warehouse and to create the necessary indexes
B.A process to load the data in the data warehouse and to create the necessary indexes
C.A process to upgrade the quality of data after it is moved into a data warehouse
D.A process to upgrade the quality of data before it is moved into a data warehouse
22.The active data warehouse architecture includes which of the following? [ ]
A.At least one data mart B.Data that can extracted from numerous internal & external sources
C.Near real-time updates D.All of the above.
23.A goal of data mining includes which of the following? [ ]
A.To explain some observed event or condition B.To confirm that data exists
C.To analyze data for expected relationships D.To create a new data warehouse
24.An operational system is which of the following? [ ]
A.A system that is used to run the business in real time and is based on historical data.
B.A system that is used to run the business in real time and is based on current data.
C.A system that is used to support decision making and is based on current data.
D.A system that is used to support decision making and is based on historical data.
25.A data warehouse is which of the following? [ ]
A.Can be updated by end users. B.Contains numerous naming conventions and formats.
C.Organized around important subject areas. D.Contains only current data.
26.A snowflake schema is which of the following types of tables? [ ]
A.Fact B.Dimension C.Helper D.All of the above
27.The generic two-level data warehouse architecture includes which of the following? [ ]
A.At least one data mart B.Data that can extracted from numerous internal and external sources
C.Near real-time updates D.All of the above.

28.Fact tables are which of the following? [ ]


A.Completely denormalized B.Partially denormalized
C.Completely normalized D.Partially normalized
29.Data transformation includes which of the following?
A.A process to change data from a detailed level to a summary level
B.A process to change data from a summary level to a detailed level
C.Joining data from one source into various sources of data
D.Separating data from one source into various sources of data
30.Reconciled data is which of the following?
A.Data stored in the various operational systems throughout the organization.
B.Current data intended to be the single source for all decision support systems.
C.Data stored in one operational system in the organization.
D.Data that has been selected and formatted for end-user support applications
31.The load and index is which of the following?
A.A process to reject data from the data warehouse and to create the necessary indexes
B.A process to load the data in the data warehouse and to create the necessary indexes
C.A process to upgrade the quality of data after it is moved into a data warehouse
D.A process to upgrade the quality of data before it is moved into a data warehouse
32.The extract process is which of the following? [ ]
A.Capturing all of the data contained in various operational systems
B.Capturing a subset of the data contained in various operational systems
C.Capturing all of the data contained in various decision support systems
D.Capturing a subset of the data contained in various decision support systems
33.A star schema has what type of relationship between a dimension and fact table? [ ]
A.Many-to-many B.One-to-one C.One-to-many D.All of the above.
34.Transient data is which of the following? [ ]
A.Data in which changes to existing records cause the previous version of the records to be eliminated
B.Data in which changes to existing records do not cause the previous version of the records to be eliminated
C.Data that are never altered or deleted once they have been added
D.Data that are never deleted once they have been added

35.A multifield transformation does which of the following? [ ]


A.Converts data from one field into multiple fields
B.Converts data from multiple fields into one field
C.Converts data from multiple fields into multiple fields
D.All of the above

36 What is ETL Stand for? [ ]


A. Execute tramit and load B. Extract transform and load
C. Excute Transform and load D. All the above

37. The extract process is which of the following? [ ]


A.Capturing all of the data contained in various operational systems
B.Capturing a subset of the data contained in various operational systems
C.Capturing all of the data contained in various decision support systems
D.Capturing a subset of the data contained in various decision support systems

38.A star schema has what type of relationship between a dimension and fact table? [ ]
A.Many-to-many B.One-to-one C.One-to-many D.All of the above.

39.Transient data is which of the following? [ ]


A.Data in which changes to existing records cause the previous version of the records to be eliminated
B.Data in which changes to existing records do not cause the previous version of the records to be eliminated
C.Data that are never altered or deleted once they have been added
D.Data that are never deleted once they have been added

40.A multifield transformation does which of the following? [ ]


A.Converts data from one field into multiple fields
B.Converts data from multiple fields into one field
C.Converts data from multiple fields into multiple fields
D.All of the above

41.A snowflake schema is which of the following types of tables? [ ]


A.Fact B.Dimension C.Helper D.All of the above

42.The generic two-level data warehouse architecture includes which of the following? [ ]
A.At least one data mart B.Data that can extracted from numerous internal and external sources
C.Near real-time updates D.All of the above.

43.Fact tables are which of the following? [ ]


A.Completely demoralized B.Partially denoralized
C.Completely normalized D.Partially normalized

44.Data transformation includes which of the following? [ ]


A.A process to change data from a detailed level to a summary level
B.A process to change data from a summary level to a detailed level
C.Joining data from one source into various sources of data
D.Separating data from one source into various sources of data

45.Reconciled data is which of the following? [ ]


A.Data stored in the various operational systems throughout the organization.
B.Current data intended to be the single source for all decision support systems.
C.Data stored in one operational system in the organization.
D.Data that has been selected and formatted for end-user support applications.

46.Which two come nearest to each other: [ ]


a. Association Rules & Classification B.Classification & Prediction
C. Classification & Clustering D. Association Rules & Clustering

47. Association rules XÞY & YÞX both exist for a given min_sup and min_conf. Pick the correct
statement(s): [ ]
a. Both ARs have same support & confidence
b. Both ARs have different support & confidence
c. Support is same but not confidence
d. Confidence is same but not support
48. The AR: Bread Butter ÞJam is an example of [ ]
a.Boolean Quantitative AR b.Boolean Multilevel AR
c. Multidimensional Multilevel AR d.Boolean Single-dimensional AR

49.In market-basket analysis for an association rule to have business value it should have: [ ]
A.Confidence B.Support C.Both D.None

50.In Apriori algorithm if large 1-itemsets are 50 then the number of candidate 2-itemsets
will be: [ ]
A.50 B.25 C.1230 D.50! (50 factorial)

FILL IN THE BLANKS


1) Successful data warehousing requires that a formal program in total quality management (TQM)
be implemented.(TRUE /FALSE)
2) Joining is the process of partitioning data according to predefined criteria. (TRUE /FALSE)

3) The role of the ETL process is to identify erroneous data and to fix them. (TRUE /FALSE)
4) Star schema is suited to online transaction processing, and therefore is generally used in
operational systems, operational data stores, or an EDW. (TRUE /FALSE)
5) A star schema has _________ type of relationship between a dimension and fact table?

6) A snowflake schema is ___________ types of tables?


7) The generic two-level data warehouse architecture includes __________.
8) Fact tables are ___________
9) Data transformation includes ____________
10) Reconciled data is _________________
11) Data scrubbing is _________________.
12) A goal of data mining includes ______________________
13) An operational system is_____________
14) A data warehouse is __________________.
15) Extracting knowledge from large amount of data is called ____________________.
16) Data warehouses and OLAP tools are based on a __________ dimensional data model.
17) The querying of multidimensional databases can be based on a ______________ model.
18) A HOLAP server combines _____________ .
19) Star schema consists of ____________& _______________ tables.
20) ___________Operation performs a selection on one dimension of the given cube.
21) The data warehouse can be built by using __________ approach.
22) Users of data mining systems can be classified into____________ categories.
23) Data base size is 100 GB to TB in
24) Smoothing techniques are ________________

25) _____________ refers to the computation of all the cuboids in the lattice.
26) IQR stands for___________________________.
27) The measures of pattern interestingness asses the ________,_________,_________,
__________ of discovered patterns.
28) A ______________ allows data to be modeled and viewed in multiple dimensions.
29) ______________ is task of discovering interesting patterns from large amounts of data.

30) _______________ data marts are sources directly from enterprise data warehouse.

31) _________contains a subset of corporate –wide data that is of value to a specific group of users.

32) ________________ converts data from legacy or host format to warehouse format.

33) _________________ is the estimate of the strength of the implication of the rule.

34) _____________ forms the logical subset of the complete data warehouse?

35) _________ is a dimension that means the same thing with every possible fact table to which it
can be joined?

36) ______________criteria is not used for selecting the data sources?

37) ____________ is true on building a Matrix for Data warehouse bus architecture?

38) ___________should not be considered for each dimension attribute?


39) __________form the set of data created to support a specific short lived business situation?

40) _____ is the special kind of clustering that identifies events or transactions that occur
simultaneously?

41) The precalculated summary values are called as __________


42) OLAP stands for _______________________
43) _____ is an efficient association rule mining algorithm that explores the level- wise mining?
44) _________allows users to focus the search for rules by providing metarules and additional
mining constraints?
45) __ ____is the collection of data objects that are similar to one another within the same group?
46) _______binning strategy, each bin has approximately the same number of tuples assigned to it?
47) ________binning strategy has the interval size of each bin the same?
48) ___________ association shows relationships between discrete objects?
49) _____algorithms attempt to improve accuracy by removing tree branches reflecting noise in the
data?
50) ______________ process includes data cleaning, data integration, data selection, data
transformation, data mining, pattern evolution, and knowledge presentation?

SECTION : A ( ANSWERS)

1. B 2. C 3. B 4. C 5. C 6. A 7. B 8. C 9. A 10. E 11. A) 12. B) 13. B) 14. A) 15. C) 16. C 17. C 18. A 19. D 20.D
21.D 22.D 23.A 24.B 25.C 26.D 27.B 28.C 29.A 30.B 31.B 32.B 33.C 34.A 35.D 36.B 37. B 38. C 39. A 40. D
41. D 42.B 43.C 44. A 45. B 46.B 47.B 48.C 49.C 50.B

SECTION B: ANSWERS

1.True 2.False 3. False 4. False 5) One-to-many 6) Fact,


Dimension, Helper 7) Data that can extracted from numerous internal and external sources
8) Completely normalized 9) A process to change data from a detailed level to a summary level
10) Current data intended to be the single source for all decision support systems 11) A process
to upgrade the quality of data before it is moved into a data warehouse 12) To explain
some observed event or condition 13) A system that is used to run the business in real time and is
based on current data 14) Organized around important subject areas. 15. data mining
16) multi 17) Star net 18) ROLAP & MOLAP 19) fact, dimension
20) slice 21. Top-down & bottom-up 22) 4 23) OLAP
24. Binning 25.Full Materialization 26. Inter Quartive Range 27.Simplicity, Certainty,
Utility, Novelty 28) Data cube 29) Data Mining 30) Dependent
31. Data mart 32.Data Transmission 33. Confidence 34) Data Mart.
35) Confirmed Dimensions 36)Platform 37)Data marts as rows and dimensions
as columns 38)Rapid changing dimension policy 39) Disposable Data Marts
40)Affinity grouping 41)Aggregates 42)Online Analytical Processing
43)Apriori Algorithm 44) Constraint based rule mining 45) Cluster
46)Equidepth binning 47)Equiwidth binning 48) Boolean 49. FP tree
50)KDD Process

You might also like