Professional Documents
Culture Documents
Chapter 11
Chapter Objective
After completing this chapter, you will be able to: o Define Business Continuity and Information Availability
o Accessibility
o Information should be accessible at the right place and to the right user
o Timeliness
o Information must be available whenever required
Impact of Downtime
Lost Productivity Number of employees impacted (x hours out * hourly rate)
Know the downtime costs (per hour, day, two days...)
Lost Revenue Direct loss Compensatory payments Lost future revenue Billing losses Investment losses
Financial Performance Revenue recognition Cash flow Lost discounts (A/P) Payment guarantees Credit rating Stock price
Other Expenses Temporary employees, equipment rental, overtime costs, extra shipping costs, travel expenses...
2009 EMC Corporation. All rights reserved.
Impact of Downtime
o Average cost of downtime per hour = average productivity loss per hour + average revenue loss per hour o Where: o Productivity loss per hour = (total salaries and benefits of all employees per week) / (average number of working hours per week) o Average revenue loss per hour = (total revenue of an organization per week) / (average number of hours per week that an organizations is open for business)
Response Time
Recovery Time
Detection
Repair
Restoration
Incident
Diagnosis
Recovery
Time Incident
Repair time
o MTBF: Average time available for a system or component to perform its normal operations between failures o MTTR: Average time required to repair a failed component
2% 1% 0.2%
99.9%
99.99%
0.1%
0.01%
8 hrs 45 min
52.5 min
10 min 5 sec
1 min
99.999%
99.9999%
2009 EMC Corporation. All rights reserved.
0.001%
0.0001%
5.25 min
31.5 sec
6 sec
0.6 sec
BC Terminologies
o Disaster recovery
o Coordinated process of restoring systems, data, and infrastructure required to support ongoing business operations in the event of a disaster o Restoring previous copy of data and applying logs to that copy to bring it to a known point of consistency o Generally implies use of backup technology
o Disaster restart
o Process of restarting from disaster using mirrored consistent copies of data and applications o Generally implies use of replication technologies
BC Terminologies (Cont.)
Recovery Point Objective (RPO) o Point in time to which systems and data must be recovered after an outage o Amount of data loss that a business can endure Recovery Time Objective (RTO) o Time within which systems, applications, or functions must be recovered after an outage o Amount of downtime that a business can endure and survive
Weeks
Days
Hours Minutes
Minutes Seconds
Synchronous Replication
Seconds
Global Cluster
Recovery-point objective
2009 EMC Corporation. All rights reserved.
Recovery-time objective
o Designing and developing contingency plans and disaster recovery plan (DR Plan)
Establishing objectives
o Determine BC requirements.
Analyzing
o Collect information on data profiles, business processes, infrastructure support, dependencies, and frequency of using business infrastructure. o Identify critical business needs and assign recovery priorities.
Implementing
o Implement risk management and mitigation procedures that include backup, replication, and management of resources. o Prepare the disaster recovery sites that can be utilized if a disaster affects the primary data center.
o Implement redundancy for every resource in a data center to avoid single points of failure.
o Test the BC plan regularly to evaluate its performance and identify its limitations.
o Assess the performance reports and identify limitations. o Update the BC plans and recovery/restart procedures to reflect regular changes within the data center.
2009 EMC Corporation. All rights reserved.
BC Technology Solutions
o The following are the solutions and supporting technologies that enable business continuity and uninterrupted data availability:
o Fault tolerant configuration
o To avoid single-point of failure
Client
FC Switches
IP
Storage Array Storage Array Remote Site Redundant Network
Multi-pathing Software
o Configuration of multiple paths increases data availability o Even with multiple paths, if a path fails I/O will not reroute unless system recognizes that it has an alternate path o Multi-pathing software helps to recognize and utilizes alternate I/O path to data
o Remote Replication
o Data from the production devices is copied to replica devices on a remote array o In the event of a failure, applications can continue to run from the target device
o Backup/Restore
o Backup to tape has been a predominant method to ensure business continuity o Frequency of backup is depend on RPO/RTO requirements
2009 EMC Corporation. All rights reserved.
Chapter Summary
Key points covered in this chapter: o Importance of Business Continuity
#1 IT company
http://education.EMC.com