Hoffer Edm PP Ch05

CHAPTER 5:
PHYSICAL DATABASE DESIGN AND

PERFORMANCE
Essentials of Database Management
Jeffrey A. Hoffer, Heikki Topi, V. Ramesh
Copyright 2014 Pearson Education, Inc.
OBJECTIVES
Define terms
Describe the physical database design
process
Choose storage formats for attributes
Select appropriate file organizations
Describe three types of file organization
Describe indexes and their appropriate use
Translate a database model into efficient
structures, and know when/how to
denormalize
Chapter 5
PHYSICAL DATABASE
DESIGN
Purposetranslate the logical

description of data into the technical
specifications for storing and
retrieving data
Goalcreate a design for storing
data that will provide adequate
performance and insure database
integrity, security, and recoverability
Chapter 5
PHYSICAL DESIGN PROCESS

Inputs
Normalized
Volume
Decisions
relations
Attribute
estimates
Attribute
Physical
record
descriptions (doesnt always
match logical design)
definitions
Response
time
expectations
Data
Leads to
security needs
Backup/recovery
Integrity
DBMS
data types
needs
expectations
File
organizations
Indexes
and database
architectures
Query
optimization
technology used
Chapter 5
PHYSICAL DESIGN FOR

REGULATORY
Sarbanes- Oxley Act (SOX) protect
COMPLIANCE
investors by improving accuracy and

reliability
Committee of Sponsoring Organizations
(COSO) of the Treadway Commission
IT Infrastructure Library (ITIL)
Control Objectives for Information and
Related
(COBIT)
Regulations
andTechnology
standards that impact
physical design decisions
Chapter 5
DESIGNING FIELDS
Field: smallest unit of application

data recognized by system
software
Field design
Choosing data type

Coding, compression, encryption
Controlling data integrity
Chapter 5
CHOOSING DATA TYPES
Chapter 5
Figure 5-1 Example of a code look-up table

(Pine Valley Furniture Company)
Code saves space, but

costs an additional lookup
to obtain actual value
Chapter 5
FIELD DATA INTEGRITY
Default valueassumed value if no

explicit value
Range controlallowable value limitations
(constraints or validation rules)
Null value controlallowing or prohibiting
empty fields
Referential integrityrange control (and
null value allowances) for foreign-key to
primary-key match-ups
Sarbanes-Oxley Act (SOX) legislates importance of financial data integrit
Chapter 5
HANDLING MISSING DATA
Substitute an estimate of the missing

value (e.g., using a formula)
Construct a report listing missing values
In programs, ignore missing data unless
the value is significant (sensitivity testing)
Triggers can be used to perform these operations

Chapter 5
10
DENORMALIZATION
Transforming normalized relations into nonnormalized physical record specifications

Benefits:
Costs (due to data duplication)
Can improve performance (speed) by reducing number of

table lookups (i.e. reduce number of necessary join queries )
Wasted storage space
Data integrity/consistency threats
Common denormalization opportunities
One-to-one relationship (Fig. 5-2)

Many-to-many relationship with non-key attributes
(associative entity) (Fig. 5-3)
Reference data (1:N relationship where 1-side has data not
used in any other relationship) (Fig. 5-4)
Chapter 5
11
Figure 5-2 A possible denormalization situation: two entities with oneto-one relationship
Chapter 5
12
Figure 5-3 A possible denormalization situation: a many-to-many

relationship with nonkey attributes
Extra table
access
required
Null description possible
Chapter 5
13
Figure 5-4
A possible
denormalization
situation:
reference data
Extra table
access
required
Data duplication
Chapter 5
14
DENORMALIZE WITH
CAUTION
Denormalization can
Increase chance of errors and inconsistencies

Reintroduce anomalies
Force reprogramming when business rules
change
Perhaps other methods could be used to

improve performance of joins
Organization of tables in the database (file

organization and clustering)
Proper query design and optimization
Chapter 5
15
DESIGNING PHYSICAL DATABASE

FILES
Physical File:
A named portion of secondary memory allocated

for the purpose of storing physical records
Tablespacenamed logical storage unit in which
data from multiple tables/views/objects can be
stored
Tablespace components
Segment a table, index, or partition

Extentcontiguous section of disk space
Data block smallest unit of storage
Chapter 5
16
Figure 5-5 DBMS terminology in an Oracle 11g environment
Chapter 5
17
FILE ORGANIZATIONS
Technique for physically

arranging records of a file on
secondary storage
Types of file organizations
Sequential
Indexed
Hashed
Chapter 5
18
FILE ORGANIZATIONS
Factors for selecting file

organization:
Fast data retrieval and throughput

Efficient storage space utilization
Protection from failure and data loss
Minimizing need for reorganization
Accommodating growth
Security from unauthorized use
Chapter 5
19
Figure 5-6a
Sequential file
organization
Records of the
file are stored
in sequence by
the primary key
field values
If sorted every
insert or delete
requires re-sort
If not sorted
Average time to
find desired record
= n/2
Chapter 5
20
INDEXED FILE
ORGANIZATIONS
Storage of records sequentially or

nonsequentially with an index that allows
software to locate individual records
Index: a table or other data structure used to
determine in a file the location of records that
satisfy some condition
Primary keys are automatically indexed
Other fields or combinations of fields can also
be indexed; these are called secondary keys
(or nonunique keys)
Chapter 5
21
Figure 5-6b Indexed file organization
uses a tree search

Average time to find desired
record = depth of the tree
Chapter 5
22
Figure 5-6c
Hashed file
organization
Hash algorithm
Usually uses divisionremainder to determine
record position. Records
with same position are
grouped in lists.
Chapter 5
23
Figure 5-7 Join Indexesspeeds up join operations

b) Join index for matching
foreign key (FK) and primary
key (PK)
a) Join
index for
common
non-key
columns
Chapter 5
24
Chapter 5
25
USING AND SELECTING KEYS
Creating a unique key index
Example: CustomerID (primary key) of

Customer
Example: Composite primary key for OrderLine
Creating a secondary key index
Example: Description field for Product (not

unique)
Chapter 5
26
RULES FOR USING INDEXES

Use on larger tables
Index the primary key of each table
Index search fields (fields frequently
in WHERE clause)
4. Fields in SQL ORDER BY and GROUP
BY commands
5. When there are >100 values but not
when there are <30 values
1.
2.
3.
Chapter 5
27
RULES FOR USING INDEXES

(CONT.)
6. Avoid use of indexes for fields with long
7.
8.
9.
values; perhaps compress values first

If key to index is used to determine location
of record, use surrogate (like sequence nbr)
to allow even spread in storage area
DBMS may have limit on number of indexes
per table and number of bytes per indexed
field(s)
Be careful of indexing attributes with null
values; many DBMSs will not recognize null
values in an index search
Chapter 5
28
QUERY OPTIMIZATION
Parallel query processingpossible when

working in multiprocessor systems
Overriding automatic query

optimizationallows for query writers to
preempt the automated optimization
Data warehouses are already configured

for optimized query performance
Chapter 5
29
All rights reserved. No part of this publication may be reproduced, stored in a

retrieval system, or transmitted, in any form or by any means, electronic,
mechanical, photocopying, recording, or otherwise, without the prior written
permission of the publisher. Printed in the United States of America.

30

Hoffer Edm PP Ch05

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Hoffer Edm PP Ch05

Uploaded by

Copyright:

Available Formats

CHAPTER 5:

PHYSICAL DATABASE DESIGN AND

Copyright 2014 Pearson Education, Inc.

Copyright 2014 Pearson Education, Inc.

Purposetranslate the logical

Copyright 2014 Pearson Education, Inc.

PHYSICAL DESIGN PROCESS

Copyright 2014 Pearson Education, Inc.

PHYSICAL DESIGN FOR

investors by improving accuracy and

Copyright 2014 Pearson Education, Inc.

Field: smallest unit of application

Choosing data type

Copyright 2014 Pearson Education, Inc.

CHOOSING DATA TYPES

Copyright 2014 Pearson Education, Inc.

Figure 5-1 Example of a code look-up table

Code saves space, but

Copyright 2014 Pearson Education, Inc.

FIELD DATA INTEGRITY

Default valueassumed value if no

Sarbanes-Oxley Act (SOX) legislates importance of financial data integrit

Copyright 2014 Pearson Education, Inc.

HANDLING MISSING DATA

Substitute an estimate of the missing

Triggers can be used to perform these operations

Copyright 2014 Pearson Education, Inc.

Transforming normalized relations into nonnormalized physical record specifications

Costs (due to data duplication)

Can improve performance (speed) by reducing number of

Common denormalization opportunities

One-to-one relationship (Fig. 5-2)

Copyright 2014 Pearson Education, Inc.

Copyright 2014 Pearson Education, Inc.

Figure 5-3 A possible denormalization situation: a many-to-many

Null description possible

Copyright 2014 Pearson Education, Inc.

Copyright 2014 Pearson Education, Inc.

Increase chance of errors and inconsistencies

Perhaps other methods could be used to

Organization of tables in the database (file

Copyright 2014 Pearson Education, Inc.

DESIGNING PHYSICAL DATABASE

A named portion of secondary memory allocated

Segment a table, index, or partition

Copyright 2014 Pearson Education, Inc.

Figure 5-5 DBMS terminology in an Oracle 11g environment

Copyright 2014 Pearson Education, Inc.

Technique for physically

Copyright 2014 Pearson Education, Inc.

Factors for selecting file

Fast data retrieval and throughput

Copyright 2014 Pearson Education, Inc.

Copyright 2014 Pearson Education, Inc.

Storage of records sequentially or

Copyright 2014 Pearson Education, Inc.

Figure 5-6b Indexed file organization

uses a tree search

Copyright 2014 Pearson Education, Inc.

Copyright 2014 Pearson Education, Inc.

Figure 5-7 Join Indexesspeeds up join operations

Copyright 2014 Pearson Education, Inc.

Copyright 2014 Pearson Education, Inc.