Professional Documents
Culture Documents
Ash Mate
WW Senior Solutions Architect
mate@us.ibm.com
Client workstations
Compute Farm
Site A Powered by
Spectrum Scale
Off Premise
Site B
Site C
Tape
Flash
Shared Nothing
Disk
Cluster
Supported platforms:
AIX™, xLinux,
pLinux, zLinux, Windows®
Function:
• Migration, Recall
Restore Operation:
1) Use copy to restore file.
2) mmrestorefs to restore global or independent fileset level snapshot.
– NOTE: mmrestorefs of a global snapshot requires file system to be unmounted.
Spectrum Protect
Spectrum Scale Spectrum Scale Cluster restore (GUI or CLI)
Server
mmbackup tool
coordinates processing
• Massive parallel filesystem backup • Usage of ACL & EA might lead to • If HSM is used use option
processing increased backup traffic MIGREQUIRESBACKUP=YES
• Spectrum Scale mmbackup • If HSM is used inline backup might • Prevent rename of directories
creates local shadow of Spectrum lead to unexpected tape mounts close to file system root
Protect DB and uses policy engine • Administrative operations on • Prevent ACL & EA changes if
to identify files for backup Spectrum Protect Server might not possible
• Spectrum Protect backup archive be observed from mmbackup (e.g.
client is used under the hood to file space deletion)
backup files to Spectrum Protect • Limited handling of include rules
Server using management class binding
• Spectrum Protect restore (CLI or
GUI) can be used to restore files
© Copyright IBM Corporation 2015 12
© 2016 IBM Corporation
Backup Of Large Spectrum Scale File Systems
Backup cycle:
• Restore operation:
- There is no command since restore are done by Spectrum Protect command (dsmc restore)
- Can restore a file/directory or whole file system
data
mmbackup
file operations
write
i.e. read/write
Spectrum Protect SERVER
Spectrum Scale Cluster
migration due to low
online storage
Migration
based on storage
pool threshold
Tape Library
– Process:
• Set LTFS EE or TSM
• Create Policy with threshold, use mmchpolicy command to install new policy
• Setup callback
No External
offline storage server
LTFS EE with separate GPFS nodes: LTFS EE connects to tape via LTFS LE+
Tape library can have multiple pools (3 in above example)
Multiple nodes can connect to the tape library – scalability for performance.
21 © 2016 IBM Corporation
Scale Out Backup and Restore (SOBAR)
Scale Out Backup and Restore (SOBAR) is a specialized mechanism for data protection
against disaster only for IBM Spectrum Scale™ file systems that are managed by Spectrum
Protect - Tivoli® Storage Manager (TSM) Hierarchical Storage Management (HSM).
Backup Process:
• Backup configuration data
• Pre-migrate files to HSM so there is copy of the data in the TSM
Restore Process:
• In case of disaster, recreate cluster and file system (using mmrestore config).
• Use image restore process to restore inode space (directory structure and file stubs)
• Now use normal HSM process to recall data (data will be recalled on demand)
migration
Solution by Solution by
• How do I stop buying expensive EMC appliances to meet the data growth?
• IBM is a leader in the enterprise backup software and integrated appliances magic quadrant
Published date: 06/15/2015 Source: Gartner Magic Quadrant for Enterprise Backup Software and Integrated Appliances
• By 2017, 70% of organizations are expected to have replaced their remote-office tape backup with a disk-based
backup solution that incorporates replication, up from 30% today
Published date: 06/16/2014 Source: Gartner
• By 2018, the number of organizations abandoning tape for backup is expected to double, and archiving to tape
should increase by 25%
Published date: 06/16/2014 Source: Gartner Magic Quadrant for Enterprise Backup Software and Integrated Appliances
• Reduce costs
– Lower infrastructure costs to achieve backup window and recovery objectives
– IBM Spectrum Protect’s built in enterprise class data dedup for no additional charge
– Lower admin efforts with simplified provisioning of storage for Spectrum Protect
– Higher storage utilization by leveraging a shared file system
– Build your infrastructure your way using low cost commodity storage
– Real time recovery for a longer retention period per dollar
Backup clients
Spectrum Protect
instances
Spectrum Scale shared file system
Storage
TSM Server TSM Server – File system for databases provide low latency
GPFS file systems – File system for storage pools provide high sequential
performance
DB STG
Tape
GPFS Storage
Running multiple TSM instances on one GPFS cluster provides standardizes, scalable and easy to use storage infrastructure for
the TSM backup environment
GPFS cluster provides a single file system and on demand resource sharing for all TSM instances
• Operational efficiency with one storage system for all Spectrum Protect servers
• Disaster protection with TSM or GPFS replication or GPFS native RAID (GNR)
• Spectrum Scale appliance (pre-packaged) Protocol / Application nodes (GPFS NSD clients)
– Graphical User Interface File server Database
Backup
Apps
Archive
– 3 Years Maintenance and Support
• Different models
Elastic Storage Server
– GS: small and fast (2 – 125 TB) (NSD Server)
– GL: large and scaling ( 150 – 1530 TB)
“Of course, you need fast network links as well and backup/archive
software that can use the links and back-end storage, as TSM can. If you
have these then your backup and archive, and subsequent restores, could
move data around like a dragster roaring down a speed strip.”
39
• See https://ibm.biz/TivoliStorageManagerBlueprints
40 © 2016 IBM Corporation
A Perfect Match:
Spectrum Protect and Elastic Storage
• Spectrum Scale and Spectrum Protect
Server(s)
http://escc.mainz.de.ibm.com | gaschler@de.ibm.com
• Lower Cost
– No extra storage required for TSM DB
– Use of standard infrastructure components
• Flexible Scalability
– Multiple TSM servers can share a single file system and storage
– Add more ESS building blocks as capacity and performance demands grow
• Ease of use with graphical user interface and TSM operation center
42
– TSM operations center provides advanced monitoring and reporting © 2016 IBM Corporation
A Smarter Storage Approach
The IBM Integrated Storage Portfolio
Thank you!
For more information:
Website: http://www-03.ibm.com/systems/storage/spectrum/index.html
Spectrum Scale
NSD Client File stored in blocks
Spectrum Scale File
System
All NSD servers export NSDs to all the
clients in active-active mode
Spectrum Scale stripes files across NSD
servers and NSDs in units of file-system
block-size Spectrum Scale NSD
Servers
NSD client communicates with all the
servers
File-system load spread evenly across all
the servers and NSDs. No HotSpots
Easy to scale file-system capacity and
performance while keeping the architecture
balanced
Client does real-time parallel I/O to all the NSD servers and
storage volumes/NSDs
© 2016 IBM Corporation
Spectrum Scale – The Complete Data Management Solution for Enterprise
environments
Single Worldwide Name Space
Use Spectrum Protect Backup Archive Client in combination with Spectrum Scale
mmbackup and Spectrum Protect for Space Management.
Backup + HSM*
Reason: Spectrum Protect for Space Management and backup archive client provide a close integration. Spectrum Protect
backup can‘t be combined with Spectrum Archive on the same file system.
General hint for HSM only environments: If Spectrum Protect is already in use and the customer has skills in this area
Spectrum Protect for Space Management can be integrated into the environment easily. If Spectrum Protect is not an
option for the customer Spectrum Archive Enterprise Edition is the best approach.
Category Spectrum Protect for Space Management Spectrum Archive Enterprise Edition
Backend Storage Backend storage provided from Spectrum Protect IBM Tape drives and libraries
Type server with wide range of storage medium types
supported (Disk, Tape, Optical, Object)
Backend Storage Data is stored in proprietary format. Tape cartridges Data is stored in open LTFS format. Single cartridges
Data Format containing data can be used only in combination can be used directly with Spectrum Archive SE or LE
with Spectrum Protect server and vice versa (export and import function)
Supported Tape Multi-vendor support, including LTO, IBM TS1100, IBM LTO and TS1100 tape drives with IBM TS3500,
Systems Oracle StorageTek, DLT and virtual tape libraries. TS4500 and TS3310 libraries
Tape library sharing Yes, multiple TSM servers can share the same tape All Spectrum Archive nodes share 1 tape library and all
libraries and tape drives, but not tape cartridges. tape cartridges. Each node requires dedicated drives.
(IBM plans to support sharing of 2 tape libraries in
4Q15).
Backend Storage Data can be collocated on filespace level to Can be collocated on file system, directory and file
Device Collocation implement dedicated storage volume usage name level
Backend Storage Spectrum Protect servers uses DB2 instances for Metadata stored on tape cartridge and file system
Metadata metadata (Spectrum Scale)
Platforms See slide: „Highlevel Architecture“
© Copyright IBM Corporation 2015
48
© 2016 IBM Corporation
Functionality Compared – Data Transfer And Scalability
Function Spectrum Protect for Space Management Spectrum Archive Enterprise Edition
File Migration • Premigration and Migration of single files and • Premigration and Migration of single files in one
multiple (small) files in one transaction. transaction
• Tape optimized migration • Tape optimized migration
File Recall • Normal recall (full file) • Normal recall (full file)
• Streaming recall (for streaming applications e.g. • Tape optimized recall
media player) • Cluster wide recall distribution
• Partial recall (for partial access applications e.g.
data bases)
• Tape optimized recall
• Cluster wide recall distribution
Scaling migrate • Add Space Management nodes • Add Spectrum Archive EE nodes
and recall • Add Spectrum Protect servers • Add tape resources
throughput • Add tape resources
Linear scalability Limited by number of Spectrum Protect server and By adding tape drives and Spectrum Archive EE
LAN connections to Spectrum Protect server nodes
Function Spectrum Protect for Space Management Spectrum Archive Enterprise Edition
File system backup Close integration with Spectrum Protect backup No support for file system backup
archive client (mmbackup)
Creating multiple Using copy storage pools in Spectrum Protect Server. By migrating data to more than one tape cartridge
copies Node replication feature of Spectrum Protect server. pools (up to 3 copies)
Preservation of POSIX attributes and full ACL / EA support are POSIX attributes are preserved on tape
attributes preserved in Spectrum Protect server
Frontend Disaster • Restore of files from backup (when available) Recreation (rebuild) of deleted stub files, requires to
Recovery (GPFS) • Recreation of deleted stub files read all tapes
• Recovery of full file system with SOBAR
Backend Disaster DB2 is central metadata storage and can be restored Content of damaged tapes can be repaired if multiple
Recovery (TSM or from backup and use copy pools to recover from copies have been created during migration.
LTFS Tapes) volume failues. Switch to replication node of
Spectrum Protect server.
High Availability • Automated failover of HSM service in terms of node Manual failover of Spectrum Archive EE services in a
failure in a multi-node cluster multi-node cluster
• Automated recovery of local HSM service in terms
of processing failures
© Copyright IBM Corporation 2015
50
© 2016 IBM Corporation
Case Studies
• Backup of Spectrum Scale / ESS
LTFS EE enables IBM tape libraries to replace tier 2/3 disk Benefits:
storage in Spectrum Scale-based tiered disk – Near instant access for users and applications
environments – Improved operations
Storage Virtualization with transparent tiering to tape – Significant cost savings over more traditional storage
– Rapid implementation and integration
• LTFS EE creates “nearline” access tier 2/3 storage
– Improved editing workflow
with tape at 1/51 the cost
– Retention of more raw footage
• Helps reduce storage expense for data that does not
need the access performance of primary disk
Challenge
• Evaluate and re-architect an outdated GPFS environment.
• Provide enhanced business continuity using a second datacenter
• Provide an infrastructure for rapidly developed applications and real time analytics
Solution
• Integrated solution comprising Spectrum Scale ESS systems, Spectrum Archive EE, ProtectTIER, TS4500
Tape Libraries, Professional Services
Key Client Benefit
• Increased storage utilization through integration of SSDs, disks and tape
• Higher level of business continuity through Spectrum Scale‘s ‘built-in‘ DR capabilities and ESS‘ data integrity
features
• Simplified management due to a homogeneous, scalable, ‘building block‘ based architecture
• Increased peformance
IBM Confidential
55
© 2016 IBM Corporation
Case Studies
• Backup to Spectrum Scale / ESS
Overview
Petascale Storage for Research Computing Industry: Education
Client: University of Oklahoma
Supercomputing Center for
Problem: growing data needs for research computing while meeting data Education and Research
management requirements of National Science Foundation (OSCER)
Products: GPFS, TSM, IBM disk and tape
storage and servers
Solution: Profile
• For high-capacity disk storage, the IBM System Storage DCS9900 was
selected—which is scalable up to 1.7 PB.
• For longer-term data storage, OU chose the System Storage TS3500 Tape
Library—with an initial capacity up to 4.3 PB and expandable to over 60 PB.
• To run these storage systems, six IBM System x3650 class servers were
selected, running IBM General Parallel File System (GPFS™) on the disk
system and IBM Tivoli Storage Manager on the tape library to automatically
move or copy data to tape.
Benefit: Neeman says one of the main reasons they chose IBM was the cost
effectiveness of the tape solution.
White paper available
IBM Confidential
57
© 2016 IBM Corporation
University of Colorado
Overview
PetaLibrary for Research Computing
Industry: Education
Client: University of Colorado
Research Computing Dept
Problem: growing data needs for research computing while meeting data Products: GPFS, TSM, IBM disk and tape
management requirements of National Science Foundation storage and servers
Solution:
Profile
• For high-capacity disk storage, the IBM System Storage DCS3700 was
selected
• For longer-term data storage, CU chose the System Storage TS3500 Tape
Library
• IBM General Parallel File System (GPFS™) for the disk system and IBM Tivoli The PetaLibrary is a National Science
Storage Manager on the tape library to automatically move or copy data to tape. Foundation-subsidized service for the
storage, archival, and sharing of research
data. It is available for a modest fee to any
Benefit: Scalable high performance solution that meets needs of broad set of US-based researcher affiliated with the
customers University of Colorado Boulder.
Video available
IBM Confidential
58
© 2016 IBM Corporation
Recent ESS Win for Fast Backup Pool Use Case
IASIS Healthcare (not yet publicly referenceable)
IASIS Healthcare chose IBM Elastic Storage Server to meet their needs for a 4. Cost - The Data Domain and other proposed backup
TSM fast backup pool. IASIS chose ESS for the following reasons: appliances (ExaGrid, Sepaton) are expensive. Cost
for Data Domain solution was around $3700/TB. The
cost for ESS solution was around $700/TB
1. Multi-use capability - No need to spend premium dollars on a backup
appliance that can only serve a singular function. With ESS, IASIS is
leveraging it not only for a backup target and replication, but for several 5. Customer desired encryption capability
other use cases. One example is an image store for an ambulatory EHR
that stores millions of images and needs these served up in a timely
manner
The Solution
2. Scalability - The existing backup appliances do not scale effectively, and
•2 GL4 Elastic Storage Server systems – with
IASIS had felt the pain of a "rip and replace." The ability of the ESS to
approximately 600TB usable space)
scale while adding performance was a great differentiator.
•GPFS Advanced Edition (includes
encryption)
3. Performance - IASIS was having challenges around backup windows and
restores. The ESS, utilizing 40GbE, will allow IASIS to shrink their backup
window and hasten their restore window within their RTO also allowing
them to meet their RPO.
IBM and IBM Business Partner Use Only © 2016 IBM Corporation