You are on page 1of 6

CS2032 DATAWAREHOUSING AND DATA MINING

Technical Questions and Answers



1.Data scrubbing is which of the following?
A. A process to reject data from the data warehouse and to create the necessary indexes
B. A process to load the data in the data warehouse and to create the necessary indexes
C. A process to upgrade the quality of data after it is moved into a data warehouse
D. A process to upgrade the quality of data before it is moved into a data warehouse

2. The @active data warehouse architecture includes which of the following?
A.At least one data mart
B.Data that can extracted from numerous internal and external sources
C. Near real-time updates
D. All of the above.
3. A goal of data mining includes which of the following?
A. To explain some observed event or condition
B.To confirm that data exists
C. To analyze data for expected relationships
D. To create a new data warehouse
4. An operational system is which of the following?
A.A system that is used to run the business in real time and is based on historical data.
B.
A system that is used to run the business in real time and is based on current
data.
C. A system that is used to support decision making and is based on current data.
D. A system that is used to support decision making and is based on historical data.

5. A data warehouse is which of the following?
A.Can be updated by end users.
B.Contains numerous naming conventions and formats.
C. Organized around important subject areas.
D. Contains only current data.

6. A snowflake schema is which of the following types of tables?
A.Fact
B.Dimension
C. Helper
D. All of the above

7. The generic two-level data warehouse architecture includes which of the following?
A.At least one data mart
B. Data that can extracted from numerous internal and external sources
C. Near real-time updates
D. All of the above.


8. Fact tables are which of the following?
A.Completely denormalized
B.Partially denormalized
C. Completely normalized
D. Partially normalized

9. Data transformation includes which of the following?
A. A process to change data from a detailed level to a summary level
B.A process to change data from a summary level to a detailed level
C. Joining data from one source into various sources of data
D. Separating data from one source into various sources of data

10. Reconciled data is which of the following?
A.Data stored in the various operational systems throughout the organization.
B.
Current data intended to be the single source for all decision support
systems.
C. Data stored in one operational system in the organization.
D. Data that has been selected and formatted for end-user support applications

11. The load and index is which of the following?
A.
A process to reject data from the data warehouse and to create the necessary
indexes
B.
A process to load the data in the data warehouse and to create the
necessary indexes
C. A process to upgrade the quality of data after it is moved into a data warehouse
D. A process to upgrade the quality of data before it is moved into a data warehouse

12. The extract process is which of the following?
A.Capturing all of the data contained in various operational systems
B. Capturing a subset of the data contained in various operational systems
C. Capturing all of the data contained in various decision support systems
D. Capturing a subset of the data contained in various decision support systems

13. A star schema has what type of relationship between a dimension and fact table?
A.Many-to-many
B.One-to-one
C. One-to-many
D. All of the above.

14. Transient data is which of the following?
A.
Data in which changes to existing records cause the previous version of the
records to be eliminated
B.
Data in which changes to existing records do not cause the previous version of the
records to be eliminated
C. Data that are never altered or deleted once they have been added
D. Data that are never deleted once they have been added

15. A multifield transformation does which of the following?
A.Converts data from one field into multiple fields
B.Converts data from multiple fields into one field
C. Converts data from multiple fields into multiple fields
D. All of the above

16 What is ETL Stand for?
A. Execute tramit and load
B. Extract transform and load
C. Excute Transform and load
D. All the above

17. Successful data warehousing requires that a formal program in total quality
management (TQM) be implemented.
A. True B.False

18.Joining is the process of partitioning data according to predefined criteria.
A.True B. False

19.The role of the ETL process is to identify erroneous data and to fix them.
A.True B. False

20.Star schema is suited to online transaction processing, and therefore is generally
used in operational systems, operational data stores, or an EDW.
A True B. False

21. Which of the following features usually applies to data in a data warehouse?
A.Data are often deleted
B.Most applications consist of transactions
C.Data are rarely deleted
D.Relatively few records are processed by applications
22. Which of the following statement is true?
A.The data warehouse consists of data marts and operational data
B.The data warehouse is used as a source for the operational data
C.The operational data are used as a source for the data warehouse
D.All of the above
23. The following is true of three-tier data warehouses:
A.Once created, the data marts will keep on being updated from the data
warehouse at periodic times
B.Once created, the data marts will directly receive their new data from the operational
databases
C.The data marts are different groups of tables in the data warehouse
D.A data mart becomes a data warehouse when it reaches a critical size
24. The following technology is not well-suited for data mining:
A.Expert system technology
B.Data visualization
C.Technology limited to specific data types such as numeric data types
D.Parallel architecture
25. What is true of the multidimensional model?
A.It typically requires less disk storage
B.It typically requires more disk storage
C.Typical business queries requiring aggregate functions take more time
D.Increasing the size of a dimension is difficult
26. We want to add the following capabilities to Table2: show the data
for 3 age groups (20-39, 40-60, over 60), 3 revenue groups (less than
$10,000, $10,000-$30,000, over $30,000) and add a new type of account:
Money market. The total number of measures will be:
A.4
B.More than 100
C.Between 10 and 30 (boundaries includeD.
D.Between 40 and 60 (boundaries includeD.

27. We want to add the following capability to Table2: for each type
of account in each region, also show the dollar amount besides the
number of customers. This adds to Table2:
A.Another dimension
B.Other column(s)
C.Other row(s)
D.Another measure for each cell
28. The most common source of change data in refreshing a data warehouse is:
A.Queryable change data
B.Cooperative change data
C.Logged change data
D.Snapshot change data
29. Which of the following statements is not true about refreshing a data warehouse:
A.It is a process of managing timing differences between the updating of data sources
and the related data warehouse objects
B.Updates to dimension tables may occur at different times than the fact table
C.The data warehouse administrator has more control over the load time lag than the
valid time lag
D.None of the above
30. A data warehouse is which of the following?
A. Can be updated by end users.
B. Contains numerous naming conventions and formats.
C. Organized around important subject areas.
D. Contains only current data.
31. An operational system is which of the following?
A. A system that is used to run the business in real time and is based on historical data.
B. A system that is used to run the business in real time and is based on current
data.
C. A system that is used to support decision making and is based on current data.
D. A system that is used to support decision making and is based on historical data.
32. The generic two-level data warehouse architecture includes which of the following?
A. At least one data mart
B. Data that can extracted from numerous internal and external sources
C. Near real-time updates
D. All of the above.
33. The @active data warehouse architecture includes which of the following?
A. At least one data mart
B. Data that can extracted from numerous internal and external sources
C. Near real-time updates
D. All of the above.
34. Reconciled data is which of the following?
A. Data stored in the various operational systems throughout the organization.
B. Current data intended to be the single source for all decision support systems.
C. Data stored in one operational system in the organization.
D. Data that has been selected and formatted for end-user support applications.
35. Transient data is which of the following?
A. Data in which changes to existing records cause the previous version of the
records to be eliminated
B. Data in which changes to existing records do not cause the previous version of the
records to be eliminated
C. Data that are never altered or deleted once they have been added
D. Data that are never deleted once they have been added
36. The extract process is which of the following?
A. Capturing all of the data contained in various operational systems
B. Capturing a subset of the data contained in various operational systems
C. Capturing all of the data contained in various decision support systems
D. Capturing a subset of the data contained in various decision support systems
37. Data scrubbing is which of the following?
A. A process to reject data from the data warehouse and to create the necessary
indexes
B. A process to load the data in the data warehouse and to create the necessary
indexes
C. A process to upgrade the quality of data after it is moved into a data warehouse
D. A process to upgrade the quality of data before it is moved into a data
warehouse
38. The load and index is which of the following?
A. A process to reject data from the data warehouse and to create the necessary
indexes
B. A process to load the data in the data warehouse and to create the necessary
indexes
C. A process to upgrade the quality of data after it is moved into a data warehouse
D. A process to upgrade the quality of data before it is moved into a data warehouse
39. Data transformation includes which of the following?
A. A process to change data from a detailed level to a summary level
B. A process to change data from a summary level to a detailed level
C. Joining data from one source into various sources of data
D. Separating data from one source into various sources of data

You might also like