You are on page 1of 27

EMC DATA

DOMAIN
OVERVIEW

Copyright 2011 EMC Corporation. All rights reserved.

EMC Data Domain:


Leadership and Innovation
A history of industry firsts

2003

2004

2005

First deduplication
NAS

2006

2007

First deduplication
virtual tape library

First deduplication
volume replication

2008

Largest
deduplication
array

Fastest backup
controller

First deduplication
directory replication
First deduplication
nearline storage

Copyright 2011 EMC Corporation. All rights reserved.

2009

Cascaded
replication

2010

2011
First longterm
retention
system for
backup and
archive

First
distributed
processing

Deduplication Dramatically Reduces


Storage Capacity Requirements
Deduplication
1030 times less data stored versus fulls + incrementals with typical retention policies

Data Stored

30

20

10

0
1

10

15

20

Weeks in Use
Deduplication storage
Traditional storage

Copyright 2011 EMC Corporation. All rights reserved.

Backup Data
Reduction/Deduplication
Time Series of Large Enterprise Implementation
2H '07
2H '08
1H '09
2H '09
1H '10
1H '11
Not in Plan

15%

15%

24%

14%
12%

27%

8%
40%
46%

31%
16%

28%

15%
4%

14%
6%

48%
Past Long-term Plan

25%

In Long-term Plan

In last three years, in-use


rates for backup
25%
26% with
deduplication have risen
from22%
15% to 48%
20%
14%

7%

21%

16%

In Near-term Plan

17%
7%

18%

10%

In Pilot/Evaluation

13%
In Use Now

Source: Wave 15 Storage Study Q2 2011, published 5/16/11, large-enterprise sample; H 07, n=151; 2H 08,
n=127; 1H 09, n=147; 2H 09, n=182; 1H 10, n=146; 1H 11, n=31;TheInfoPro (www.theinfopro.com)

Copyright 2011 EMC Corporation. All rights reserved.

Backup Data
Reduction/Deduplication
Large Enterprise
EMC
Competitor 1

The in-use rating for


EMC is now over 3x that
of its nearest
competitor

Competitor 2
Competitor 3
Competitor 4
Competitor 5
Competitor 6
Competitor 7
0%
Pas t Long-te rm Plan (> 18 M onths Out)

10%
Long-term Plan

20%
Ne ar-te rm Plan

30%

40%

50%

In Pilot/Evaluation (Budget Has Alre ady Be en Allocate d)

60%

70%

In Use Now (Not Including Pilots)

Source: Wave 15 Storage Study Q2 2011, published 5/16/11, large-enterprise sample, n=31,TheInfoPro (www.theinfopro.com)

Copyright 2011 EMC Corporation. All rights reserved.

Purpose-Built Backup Appliances


Open Systems + Mainframe

EMC:
64.2%
2010 Total
Market

EMC
IBM
HP
Oracle
Quantum
Sepaton
FalconStor
Dell
Others

$1.69B

ource: Worldwide Purpose-Built Backup Appliance 20112015 Forecast and 2010 Vendor Shares, May 2011, IDC.
Chart: Worldwide Supplier Revenue, Total PBBA Market

Copyright 2011 EMC Corporation. All rights reserved.

With Data Domain Deduplication


Storage Systems, You Can
Retain longer
Keep backups onsite longer with less
disk for fast, reliable restores, and
eliminate the use of tape for
operational recovery

Replicate smarter
WAN

Move only deduplicated data over


existing networks with up to 99%
bandwidth efficiency for cost-effective
disaster recovery

Recover reliably
Continuous fault detection and selfhealing ensure data recoverability to
meet service level agreements
Copyright 2011 EMC Corporation. All rights reserved.

Deduplication
Fundamentals

Copyright 2011 EMC Corporation. All rights reserved.

Data Domain Basics


Easy integration with existing environment
Control Tier
Backup and
Archive
EMC
Applications
Symantec
CommVault
IBM

Target Tier

Disaster Recovery Tier

CIFS, NFS,
NDMP, DD
Boost
Replication

Ethernet
Virtual Tape
Library (VTL)
over
Fibre Channel

HP
Veeam
Quest

Copyright 2011 EMC Corporation. All rights reserved.

DD890 appliance

DD890 appliance

2U
2 to 14 ports
10 and 1 Gigabit Ethernet; 8 Gb/s Fibre Channel
RAID 6
Up to 285 TB usable capacity with shelves
2 TB or 1 TB 7.2K rpm SATA HDD in shelf
File system
NVRAM
N+1 fans and redundant, hot-plug power supplies

Data Deduplication: Technology


Overview
Store more backups in a smaller footprint
Friday Full Backup

A B C D A E
Mon Incremental

Tues Incremental

Weds Incremental

Thurs Incremental

F G
H

Backup
Data

Estimated
Logical Reduction

FRIDAY FULL

1 TB

Physical

24x

250 GB

Monday Incremental

100 GB

710x

10 GB

Tuesday Incremental

100 GB

710x

10 GB

Wednesday Incremental
10 GB

100 GB 710x

Thursday Incremental 100 GB

710x

Second FRIDAY FULL

1 TB

5060x 18 GB

2.4 TB

7.8x

10 GB

Second Friday Full Backup

B C D E

L G H

TOTAL

308 GB

ABCDE FGH I J K L

Copyright 2011 EMC Corporation. All rights reserved.

10

Retain: Store More for Longer with


Less
Over one year of retention in 3U of Data Domain
deduplication storage
Backup
Data

Cumulative
Logical

First Full

1 TB

4x

250 GB

Week 1

April 7

2.4 TB

8x

308 GB

Week 2

April 14

3.8 TB

10x

366 GB

Week 3

April 21

5.2 TB

12x

424 GB

Month 1

April 28

6.6 TB

14x

482 GB

Month 2

May 31

12.2 TB

17x

714 GB

Month 3

June 30

17.8 TB

19x

946 GB

Month 4

July 31

23.4 TB

20x

1,178 GB

TOTAL

23.4 TB

20x

1,178 GB

Copyright 2011 EMC Corporation. All rights reserved.

Estimated
Reduction

Physical

11

Data Integrity:
Data Invulnerability Architecture
End-to-end data verification

Self-healing file system


Cleaning
Expired data
Defrag
Verify

Other
RAID 6
NVRAM
Snapshots

Copyright 2011 EMC Corporation. All rights reserved.

Generate
Checksum

Verify
Data

File System
Deduplication
Local Compression
RAID

Re-Checksum and Compare

Checksum
Deduplication, write to disk
Verify

Verify the file


system
metadata
integrity
Verify user data
integrity
Verify stripe
integrity

End-to-end data verification

12

Network-Efficient Replication for


True Disaster Recovery
Lowers WAN costs; improves service level agreements
15%
DB

Archive data
Data Domain system

15%

15%
Home

Data Domain system


Source:
Remote sites

One-to-many
Many-to-one
Bi-directional
System-tosystem
WAN Cascaded

Data Domain system

Backup data

Flexible
replication

Home

Data Domain
Global Deduplication Array

Destination:
Data Center Hub
Supports hundreds
of remote sites

9599% cross-site bandwidth reduction


Copyright 2011 EMC Corporation. All rights reserved.

13

DD Boost Software
Distributes parts of deduplication process to
backup server or application clients
DD
Boost

Licensable software works across Data Domain


portfolio

Supports majority of backup software market


EMC Avamar and NetWorker
Symantec NetBackup and Backup Exec

Speeds backups by up to 50 percent


Process more backups with existing
resources
2040% less overall impact to backup server
8099% less LAN bandwidth

Enables Data Domain replication


management from the backup application

Copyright 2011 EMC Corporation. All rights reserved.

14

Additional Data Domain Software


Options
Data Domain Virtual Tape
Library
Easily integrates with Fibre
Channel
Emulates multiple tape
libraries

Data Domain Replicator


Network-efficient and
encrypted
Transfers only compressed,
deduplicated data over the
WAN

Supports open systems and


IBM i operating environments

Consolidate up to 270 remote

Data Domain Retention


Lock

Data Domain Encryption

File locking to satisfy IT


governance
and compliance policies
Electronic data shredding

sites into a single system

Inline encryption of data at


rest
Satisfies internal governance
rules and compliance
regulations
Protects against theft or loss
of
a physical system

Copyright 2011 EMC Corporation. All rights reserved.

15

DD Archiver Overview
Cost-optimized, long-term retention
Data Domain system for backup and archive
Active tier: short-term data protection; less than 90
days
Archive tier: scalable long-term retention; multiple
years

High-throughput deduplication storage


Up to 9.8 TB/hr

Cost optimized for long-term retention


Up to 570 TB usable, 28.5 PB logical capacity
Low cost per gigabyte while maintaining high
throughput
Fault isolation of archive units for long-term
recoverability

Leverage existing Data Domain system


advantages
Supports DD Replicator and DD Retention Lock
Copyright 2011 EMC Corporation. All rights reserved.

16

Industrys Most Scalable Inline


Deduplication Systems
DD800
Appliance Series

Global Deduplication
Array

DD Archiver

DD600
Appliance Series

Software options:
DD Boost, DD Virtual Tape Library, DD
Replicator,
DD Retention Lock, and DD Encryption

DD160
Appliance

DD160

DD620

DD640

DD670

DD860

DD890

Global
Deduplication
Array

DD
Archiver

Speed (DD
Boost)

1.1 TB/hr

2.4 TB/hr

3.4 TB/hr

5.4 TB/hr

9.8 TB/hr

14.7 TB/hr

26.3 TB/hr

9.8 TB/hr

Speed (other)

667 GB/hr

1.1 TB/hr

2.3 TB/hr

3.6 TB/hr

5.1 TB/hr

8.1 TB/hr

10.7 TB/hr

4.3 TB/hr

Logical
capacity

40195 TB

83415 TB

0.321.6 PB

0.62.7 PB

1.47.1 PB

2.914.2
PB

5.728.5 PB

5.728.5
PB

Usable
capacity

Up to 3.98
TB

Up to 8.3
TB

Up to 32.2
TB

Up to 55.9
TB

Up to 142
TB

Up to 285
TB

Up to 570 TB

Up to 570
TB

Copyright 2011 EMC Corporation. All rights reserved.

17

Deduplication
Storage Evaluation
Criteria

Copyright 2011 EMC Corporation. All rights reserved.

18

Methodology:
Inline vs. Post-Process Deduplication
INLINE

Deduplication Before Storing


Deduplication

Other activities
unimpeded
Predictable
Simpler

POST-PROCESS

Deduplication After Storing


Store

Deduplication

3x disk
accesses to
shared store

The more processes, the more


resource contention
Copy to tape: Too slow to stream tape
Recovery: Service level agreement
predictability
Replication: Poor time-to-disaster-recovery
Deduplication: If interleaved with backup or
restore

More administration to fight these


issues
Copyright 2011 EMC Corporation. All rights reserved.

19

Performance:
CPU-Centric vs. Spindle-Bound
Data Domain

Throughput MB/s

6,000

Fibre Channel

SATA

Most
deduplication
vendors
50

50

100

150

200

Number of Disk Spindles

Copyright 2011 EMC Corporation. All rights reserved.

20

Data Domain Systems Trajectory

Throughput GB/s

Data Domain SISL Scaling Architecture: CPU-centric


e r on
l
l
ro ati
Improvement since 2004:
t
on plic
c
Throughput: ~175x
al edu
u
Capacity:
~450x
D l D ray
ba
Ar
st
5
o
o
l
o
G
B
DD
ard
d
3
tan
s
r,
e
l
ol ols
r
t
on ot oc
c
1.5
r
le
p
g
Sin

0.04
DD200 (2004)
2004

Copyright 2011 EMC Corporation. All rights reserved.

2010

2011

2014 (est.)

Future

21

Why Data Domain?


Less disk to resource, less to
manage
CPU-centric deduplication
Inline deduplication

Simple, mature, and flexible


Simple, mature appliance
Any fabric, any software, backup
or archive applications

Resilience and disaster


recovery
Storage of last resort
Fast time-to-disaster recovery
(DR) readiness
Cross-site global compression
Data center or remote office
Copyright 2011 EMC Corporation. All rights reserved.

22

Data Domain Infrastructure and


Ecosystem
Supports a variety of workloads and data types
VMware
Microsoft
Microsoft SharePoint
Oracle
SAP

Backup

Archive
NAS, SAN, DAS

Primary
storage

Midrange and
Mainframe
IBM i
EMC Bus-Tech

Backup Applications

Archive Applications

EMC
CA
IBM
Symantec
HP
Atempo
CommVault Vizioncore BakBone

EMC
F5 Networks
Symantec
CommVault

Network

Copyright 2011 EMC Corporation. All rights reserved.

Disaster Recovery
Replication
over WAN

24

Enterprise Recoverability Readiness


at Disaster Recovery Site
Data Domain
inline
deduplicated
replication

DR-ready

Replicate during backup

Adaptive
post-process
deduplicated
replication

Backup to Cache

Scheduled
post-process
deduplicated
replication

Backup to Cache

Backup time 1.7-times longer than Data Domain

DR-ready

Deduplicate and replicate less than 50% ingest speedtwo times longer if uncompressed at fixed bandwidth
Backup time 1.1-times longer than Data Domain

DR-ready

Deduplicate and replicate less than 50% ingest speedtwo times longer if uncompressed at fixed bandwidth
Backup to VTL

VTL/tape/truck

Recall tapes
Copy to tape
Truck to storage

Copyright 2011 EMC Corporation. All rights reserved.

Truck from storage

25

EMC Global Services


Strategize
CONSULTING
Strategic
Observation
service establishes
a roadmap/vision
to meet your
recovery objectives
Operational
Readiness service
recommends a
Reference
Architecture that
leverages EMC
deduplication
technologies and
optimizes your
implementation

Design

Implement

TECHNOLOGY
DEPLOYMENT

MANAGED
SERVICES

Best practice
methodologies
from architecture
through integration

Residency Services
provide onsite or
remote skilled
service
professionals with
proven best
practices and
technology
expertise

Assessment,
Design/
Implementation,
Operational
Assurance, Health
Check, Data
Migration

Copyright 2011 EMC Corporation. All rights reserved.

MAINTENANCE AND
SUPPORT
360 global,
proactive, and
preemptive
procedures and
solution support

Manage
EDUCATION
Open Storage
Technology
education, EMC
technology-specific
learning paths,
EMC Proven
Professional
Certification

Remote Managed
Services provide
cost-effective, ITILbased, 24x7
intelligent remote
monitoring and
operational
infrastructure
management

26

Why EMC Global Services ?


Save money
Significantly lower implementation and operating expenditures
Fill internal resource gaps for less
Protect investments in EMC solutions

Accelerate time to value


Reduce deployment time
Accelerate return on investment for new projects
Ease the burden of compliance while protecting critical business
information

Mitigate risk and get better results


Configure the solution to meet your requirements
Improve service levels; reduce management costs
EMC best practices and unmatched product expertise = superior
customer experience
Reduce disruption while taking advantage of the features and
benefits of the latest EMC products and solutions

Copyright 2011 EMC Corporation. All rights reserved.

27

You might also like