Professional Documents
Culture Documents
Options
Alan McSweeney
Objectives
• Backup
− Ensure efficient recoverability of data
− Does not make backup data directly available
− Optimised to bring large amounts of data back online quickly for system
recovery
− Retention management at the volume level
− Not oriented to long-term management beyond life of current environment
and media
• Archiving
− Copy from online environment to separately managed (secure) storage to
reduce cost of storage and enforce retention
− Provides easy (ideally transparent) access for retrieval
− Optimised to write and retrieve data at file granularity
− File-level retention management
− Designed to manage data over long-term, through media migration and
with access auditing and controls
− Designed to manage multiple copies of data on different media types
Tape/optical media
Retrieve from
Secondary/Tertiary to
Primary
Tertiary
Storage
Take Copy
Immediately
• Disk Storage
• Tape Storage — Manual or Automated
• Optical Storage — Manual or Automated
• Hybrid devices
− VTL (Virtual Tape Library)
− EMC Centera
− IBM DR550
− Storage gateways
Disk — Advantages
• Speed - FC and SATA disk technologies allow the data to be
housed on the appropriate disks
• SATA Drive technology has mature and can lead to decreased
acquisition costs
• FC and SATA can be used within the same storage system for
primary and secondary data
• Storage Virtualisation
− Virtualise disk arrays within a storage system
− Virtualise storage systems within a fabric
− Thin provisioning allows over commitment of disk — reducing acquisition
costs
− Single Instance Storage (Deduplication) can be used but its effectiveness
depends in the nature of the data
Disk — Disadvantages
• Acquisition cost
• Disk systems do not interoperate well
• Management - multiple skill sets may be required even
if all storage systems are from the same vendor
• Most hardware vendors focus on ensuring hardware
resilience, data resilience is not their concern
• Operating costs — power, air conditioning, maintenance
• Advantages
− Control of costs
− Keep fixed number of media within automated library unit
(could keep none)
• Disadvantages
− External media needs media management and control
• Media management is greater for smaller capacity optical disks
− Manual costs of media management
GB Hours
Tape Read Tape Write Optical Optical
Time Time Read Time Write Time
Optical — Advantages
• Reduced cost over disk
• Larger capacity media planned for the future
• Can have embedded encryption
• Long media shelf life before refresh is required
• Very reliable medium
• True WORM option
Optical — Disadvantages
• Low capacity
• Media must be managed offline unless multiple libraries
are bought
• Low data access speed — not suited to large data volume
restores
800 9,000
Capacity GB - Past and Current
8,000
700
Capacity GB - Future
7,000
600
6,000
500
5,000
400
4,000
300
3,000
200
2,000
100 1,000
0 0
1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013
Optical Media Capacity Tape Media Capacity Future Optical Media Capacity Future Tape Media Capacity
Tape — Advantages
• Cost
• Very well defined road map for LTO
− LTO4 (Dec 2006) - 1.6TB (2:1 compression) and data transfer rates of up to
240 MB/second (2:1 compression)
− LTO5 (Planned) - 3.2 TB (2:1 compression) and data transfer rates of up to
360 MB/second (assuming a 2:1 compression)
− LTO6 (Planned) - 6.4 TB (2:1 compression) and data transfer rates of up to
540 MB/second (assuming a 2:1 compression)
• High capacity media
• Designed for large data volume restore
• Multiple media can be streamed to aggregate capacity and speed
• Can have embedded encryption
Tape — Disadvantages
• Media shelf life — medium
• Media long-term reliability
• Cumbersome single file restores
• Sequential access medium
• TS7510 • TS7520
• 96 TB Capacity at 2:1 • 2.6 PB Capacity at 2:1
Compression Compression
• Maximum number of virtual • Maximum number of virtual
libraries — 128 libraries — 512
• Maximum number of virtual • Maximum number of virtual
drives — 1,024 drives — 4,096
• Maximum number of virtual • Maximum number of virtual
cartridges — 8,192 cartridges — 64,000
• Maximum number of • Maximum number of
concurrent backups – 32 concurrent backups – 32
• VLS1000i • VLS6000
• 3 TB Capacity at 2:1 • 105 TB Capacity at 2:1
Compression Compression
• Maximum number of virtual • Maximum number of virtual
libraries — 6 libraries — 16
• Maximum number of virtual • Maximum number of virtual
drives — 12 drives — 128
HSM
• HSM is a principle most products offer the same basic
functionality
− Automatic migration and management of data from one
medium to another
− Stubs or pointer are left in place of migrated files
− Speed of retrieval depends upon speed of hardware upon
which the files have been migrated to, this gives online, near-
line and off-line options
Bridgehead Software
• Small company, employee owned
− Can they offer the level of service and support required when really
needed
− Are they possible acquisition targets
• Ideal for mid — large customers
− Can it handle the levels of data over time
Caminosoft
• Major corporation — publicly listed and managed by SEC rules
and regulations
• Primary focus is on managing file server type data
• Repackaged by vendors such as CA
November 26, 2009 33
Software Options
Symantec
• Major corporation
• Two products:
− NetBackup
− Enterprise Vault
• NetBackup
− HSM does not support Windows
• Enterprise Vault
− KVS staff still provide support, separate entity within Symantec
− Focus is largely on email and compliance
− Some integration with NetBackup
− Files to be migrated are collected into CAB files
− Entire CAB file recalled
− Poor support for tape as archival medium
• Recommended that you only use tape for data that is seldom or never accessed
IBM — Tivoli
• Major corporation
• Vast knowledge within the company
• Extensive R&D budgets
• Agents
and options from most major software and
hardware vendors
HP — File Archiver
• Major corporation
• Vast knowledge within the company
• Extensive R&D budgets
• “Simple Lightweight Solution” according to HP
HSM Product
What is Required from chosen vendor / application?
• Stable and functionally bullet proof solution
• Easy to use
• Capable of handling files
• Capable of handling data volumes
• Must integrate with backup application (so as NetBackup does
not initiate a restore when backing up or restoring stubs)
• Expert support knowledge
• Expert integration knowledge
− These products are dependant on hardware vendors solutions
Without ASIS – 74
total blocks
Medical Imaging
Software Archive
Technical Pubs
Archive
DataBase
Backup
• Storage
estimates expressed as raw capacities required
to accommodate data
• Includes
overhead for effective usability, RAID,
snapshots, online spare, less than 100% utilisation, etc.
• Primary
storage after 5 years with 10% annual growth =
25,580 GB
• Equates to at least 34,533 GB of raw disk capacity
180,000
160,000
140,000
120,000
100,000
GB
80,000
60,000
40,000
20,000
0
1
10
13
16
19
22
25
28
31
34
37
40
43
46
49
52
55
58
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
M
M
Total Secondary GB Total Primary GB Total Tertiary GB
3,000
2,500
Number of Media
2,000
1,500
1,000
500
0
Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month
1 5 9 13 17 21 25 29 33 37 41 45 49 53 57
Month
250,000
200,000
150,000
GB
100,000
50,000
0
1
M h7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
52
55
58
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
t
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
M
M
Total Secondary GB Total Primary GB Total Tertiary GB
3,000
2,500
Number of Media
2,000
1,500
1,000
500
0
Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month
1 5 9 13 17 21 25 29 33 37 41 45 49 53 57
Month
250,000
200,000
150,000
GB
100,000
50,000
0
1
10
13
16
19
22
25
28
31
34
37
40
43
46
49
52
55
58
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
M
M
Total Secondary GB Total Primary GB Total Tertiary GB
4,000
3,500
Number of Media
3,000
2,500
2,000
1,500
1,000
500
0
Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month
1 5 9 13 17 21 25 29 33 37 41 45 49 53 57
Month
250,000
200,000
150,000
GB
100,000
50,000
0
1
10
13
16
19
22
25
28
31
34
37
40
43
46
49
52
55
58
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
on
M
M
Total Secondary GB Total Primary GB Total Tertiary GB
4,500
4,000
3,500
Number of Media
3,000
2,500
2,000
1,500
1,000
500
0
Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month
1 5 9 13 17 21 25 29 33 37 41 45 49 53 57
Month
1,600,000
1,400,000
1,200,000
1,000,000
GB
800,000
600,000
400,000
200,000
0
Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month Month
6 12 18 24 30 36 42 48 54 60 66 72 78 84 90 96 102 108 114 120
1,800
1,600
1,400
1,200
Hours
1,000
800
600
400
200
0
1
5
M h9
on 3
on 7
on 1
on 5
on 9
on 3
on 7
on 1
on 5
on 9
on 3
on 7
on 1
on 5
on 9
on 3
on 7
on 1
on 5
on 9
on 3
7
on 05
on 09
on 13
7
1
9
M th 9
10
11
th
th
1
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
th
t
on
on
on
th
th
th
th
th
on
on
on
M
M
Tape Write Time Hours 10% Growth Optical Write Time Hours 10% Growth Tape Write Time Hours 20% Growth
Optical Write Time Hours 20% Growth Tape Write Time Hours 30% Growth Optical Write Time Hours 30% Growth
• Factors:
− 2 or 3 tiers
− Optical, tape or VTL as the last tier
− Use of existing storage (HP/Dell) or new storage
− DR or no DR
• Offsite manual copy or replication
− Software HSM — use existing NetBackup or other: HT
FileStore, CaminoSoft, IBM Tivoli
• Secondary disk
− Data is retrieved to primary immediately — available within
seconds/minutes
• Secondary/tertiary VTL
− Data is retrieved to primary immediately — available within minutes
• Secondary/tertiary tape library
− Data is retrieved to primary immediately — available within minutes
• Secondary/tertiary optical library
− Data is retrieved to primary immediately — available within hours
• Manual media retrieval
− Retrieval times depends on media location and staff allocated to media
handling
• Primary storage
mirrored for
resilience
• SAN switches
• SAN controllers
• Two disks per shelf
• Entire site
Advantages
• High performance
• Low manual intervention
• Highly resilient
Disadvantages
• High cost of acquisition and operation
• Growth in data volumes means additional expense
• No upper limit on cost
November 26, 2009 77
Physical Option 3 — Existing Hardware
Advantages
• Cost
Disadvantages
• Investment in old technology
• Software based HSM product skills required
Advantages
• Cost — use of existing hardware
• Some skill sets already in organisation
• Media life is increased with UDO
Disadvantages
• Cost — UDO or new tape library
• Management of archived media — especially UDO as they are
low capacity
• Investment in old technology
• Software based HSM product skills required
• UDO retrieval speeds
Advantages
• Some skill sets already in organisation
• No new third party migration tool absolutely necessary
• Extension of NetBackup system using NetBackup Storage
Migrator
Disadvantages
• Cost — VTL with required capacity can be expensive
• Cannot take VTL backups offsite — tertiary solution still required
• Lack of vendor implementation experience
Advantages
• Speed of retrieval
• No new third party migration tool absolutely necessary
• Simplicity
• Integration with NetBackup — no effect on daily backup routines
• Information store can be split across multiple information stores
to give multiple PB capacity is required
Disadvantages
• Cost — may be expensive initially but storage can be added over
time as needed
November 26, 2009 89
Central Management — Storage Virtualisation
Disadvantages
• Vendor based skill are still ultimately required
Alan McSweeney
alan@alanmcsweeney.com