Professional Documents
Culture Documents
1. What are the essential differences between defining requirements for operational
systems and for data warehouses?
In providing information about the requirements for an operational system, the users
are able to give you precise details of the required functions, information content, and
usage patterns.
In striking contrast, for a data warehousing system, the users are generally unable to
define their requirements clearly. They cannot define precisely what information they
really want from the data warehouse, nor can they express how they would like to use
the information or process it.
For most of the users, this could be the very first data warehouse they are being
exposed to. The users are familiar with operational systems because they use these in
their dailywork, so they are able to visualize the requirements for other new
operational systems. They cannot relate a data warehouse system to anything they
have used before. If, therefore, the whole process of defining requirements for a data
warehouse is so nebulous.
2. Explain business dimensions. Why and how can business dimensions be useful for
defining requirements for the data warehouse?
Even though the users cannot fully describe what they want in a data warehouse, they
can provide you with very important insights into how they think about the business.
They can tell you what measurement units are important for them. Each user
department can let you know how they measure success in that particular department.
The users can give you insights into how they combine the various pieces of
information for strategic decision making.
Managers think of the business in terms of business dimensions. Figure 5-1 shows the
kinds of questions managers are likely to ask for decision making. The figure shows
what questions a typical marketing vice president, a marketing manager, and a
financial controller may ask.
If the users of the data warehouse think in terms of business dimensions for decision
making, you should also think of business dimensions while collecting requirements.
Although the actual proposed usage of a data warehouse could be unclear, the
business dimensions used by the managers for decision making are not nebulous at
all. The users will be able to describe these business dimensions to you. You are not
totally lost in the process of requirements definition. You can find out about the
business dimensions.
sales, forecast sales, and budget sales. The business dimensions along which these
measurements are to be analyzed are shown at the top of the diagram as column
headings. In our example, these dimensions are time, location, product, and
demographic age group. Each of these business dimensions contains a hierarchy or
levels. For example, the time dimension has the hierarchy going from year down to
the level of individual day. The other intermediary levels in the time dimension could
be quarter, month, and week. These levels or hierarchical components are shown in
the information package diagram.
6. List the types of users who must be interviewed for collecting requirements. What
information can you expect to get from them?
The types of users of the data warehouse as follows:
Business analysts
Executives will give you a sense of direction and scope for your data warehouse.
They are the ones closely involved in the focused area. The key departmental
managers are the ones who report to the executives in the area of focus. Business
analysts are the ones who prepare reports and analyses for the executives and
managers. The operational system DBAs and IT applications staff will give you
information about the data sources for the warehouse.
8. Why are reviews of existing documents important? What can you expect to get out of
such reviews?
Although most of the requirements gathering will be done through interviews, group
sessions, and questionnaires, you will be able to gather useful information from the
review of existing documentation. Review of existing documentation can be done by
the project team without too much involvement from the users of the business units.
Scheduling of the review of existing documentation involves only the members of the
project team.
Documentation from User Departments:
What can you get out of the existing documentation? First, let us look at the reports
and screens used by the users in the business areas that will be using the data
warehouse. You need to find out everything about the functions of the business units,
the operational information gathered and used by these users, what is important to
them, and whether they use any of the existing reports for analysis.
You need to look at the user documentation for all the operational systems used. You
need to grasp what is important to the users. The business units usually have
documentation on the processes and procedures in those units. How do the users
perform their functions? Review in detail all the processes and procedures. You are
trying to find out what types of analyses the users in these business units are likely to
be interested in. Review the documentation and then augment what you have learned
from the documentation prepared from the interview sessions.
Documentation from IT:
The documentation from the users and the interviews with the users will give you
information on the metrics used for analysis and the business dimensions along which
the analysis gets done. But from where do you get the data for the metrics and
business dimensions? These will have to come from internal operational systems.
You need to know what is available in the source systems.
Where do you turn for information available in the source systems? This is where the
operational system DBAs and application experts from IT become very important for
gathering data. The DBAs will provide you with all the data structures, individual
data elements, attributes, value domains, and relationships among fields and data
structures. From the information you have gathered from the users, you will then be
able to relate the user information to the source systems as ascertained from the IT
personnel.
Work with your DBAs to obtain copies of the data dictionary or data catalog entries
for the relevant source systems. Study the data structures, data fields, and
relationships.
Eventually, you will be populating the data warehouse from these source systems, so
you need to understand completely the source data, the source platforms, and the
operating systems.
Now let us turn to the IT application experts. These professionals will give you the
business rules and help you to understand and appreciate the various data elements
from the source systems. You will learn about data ownership, about people
responsible for data quality, and how data is gathered and processed in the source
systems. Review the programs and modules that make up the source systems. Look at
the copy books inside the programs to understand how the data structures are used in
the programs.
9. Various data sources feed the data warehouse. What are the pieces of information
you need to get about data sources?
This piece of information is essential in the requirements definition document.
Include all the details you have gathered about the source systems. You will be using
the source system data in the data warehouse. You will collect the data from these
source systems, merge and integrate it, transform the data appropriately, and populate
the data warehouse.
Typically, the requirements definition document should include the following
information:
10. Name any five major components of the formal requirements definition document.
Describe what goes into each of these components.
1) Introduction. State the purpose and scope of the project. Include broad project
justification. Provide an executive summary of each subsequent section.
EXERCISES
1. Indicate if true or false:
A) Requirements definitions for a sales processing operational system and a sales
analysis data warehouse are very similar.
(False)
B) Managers think in terms of business dimensions for analysis.
(True)
()
(False)
()
(False)
I) Departmental managers are very good sources for information on data structures
of operational systems.
()
J) Information package diagrams are essential parts of the formal requirements
definition document.
(True)
2. You are the vice president of marketing for a nation-wide appliance manufacturer
with three production plants. Describe any three different ways you will tend to
analyze your sales. What are the business dimensions for your analysis?
3. BigBook, Inc. is a large book distributor with domestic and international distribution
channels. The company orders from publishers and distributes publications to all the
leading booksellers. Initially, you want to build a data warehouse to analyze
shipments that are made from the companys many warehouses. Determine the
metrics or facts and the business dimensions. Prepare an information package
diagram.
4. You are on the data warehouse project of AuctionsPlus.com, an Internet auction
company selling upscale works of art. Your responsibility is to gather requirements
for sales analysis. Find out the key metrics, business dimensions, hierarchies, and
categories. Draw the information package diagram.
5. Create a detailed outline for the formal requirements definition document for a data
warehouse to analyze product profitability of a large department store chain.