You are on page 1of 16

White Paper

Symantec NetBackup OpenStorage


Data Domain Deduplication Storage Best Practices Guide

Abstract
Conceptually understanding OpenStorage software and the API-based integration with Symantec NetBackup provides a clear view of the business value and technical merits of the integration. This guide moves past the conceptual stage to solution planning and deployment. Best practice guidelines are covered with the goal of eliminating implementation challenges. Knowledge and experience gained from assisting early adopters is logically presented for the overall benet of those deploying an OpenStorage solution with Data Domain deduplication storage.

DEDUPLICATION STORAGE

Symantec NetBackup OpenStorage


Data Domain Deduplication Storage Best Practices Guide

Table of Contents
1 2 INTRODUCTiON .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.1 Target AuDIence. . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.2 EXecutIVe Summary . . . . . . . . . . . . . . . . . . . . . . . . 3 PLANNiNG.. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2.1 NamIng ConVentIons. . . . . . . . . . . . . . . . . . . . . . . 3 2.2 NetWorks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.3 DocumentatIon. . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 5 MULTipLE NETBACKUp DOMAiN CONFiGURATiONS .. . 12 6 DiSASTER RECOVERY.. . . . . . . . . . . . . . . . . . . . . . . . . . . 13 6.1 WItHIn tHe Same NetBackup DomaIn. . . . . . . . 14 6.2 To a DIfferent NetBackup DomaIn. . . . . . . . . . 14

7 ADDiTiONAL REFERENCES.. . . . . . . . . . . . . . . . . . . . . . . 14 8 CONCLUSiON. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 9 AppENDiX MiGRATiON TO OpENSTORAGE .. . . . . . . . 15 9.1 MuLtIpLe ProtocoLs on One Data DomaIn SYSTEM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 9.2 EXIstIng Backups RetaIn or DupLIcate?. . . . . 15 9.3 Are Storage LIfecycLe PoLIcIes ReQuIreD?. . . . 15 9.4 NetBackup PoLIcy MoDIfIcatIon . . . . . . . . . . . . 15 9.5 Legacy RepLIcatIon. . . . . . . . . . . . . . . . . . . . . . . . 15 9.6 DeLetIng Legacy Storage UnIts. . . . . . . . . . . . . 16

3 OpTiMiZED DUpLiCATiON. ......................... 8 3.1 Storage UnIts anD Storage SerVer Access . . . 8 3.2 NetWork ConsIDeratIons . . . . . . . . . . . . . . . . . . . 9

3.3 THrottLIng OptImIZeD DupLIcatIon. . . . . . . . . . . 9 3.4 OptImIZeD DupLIcatIon FaILures. . . . . . . . . . . . . . 9 3.5 SeeDIng Remote Data DomaIn SYSTEMs. . . . . . 10 3.6 DupLIcatIon Job ConfIguratIon OptIons. . . . 10

4 DUpLiCATiON TO TApE.. . . . . . . . . . . . . . . . . . . . . . . . . . 11 4.1 HIerarcHIcaL DupLIcatIon . . . . . . . . . . . . . . . . . 11 4.2 Tape CreatIon from tHe PrImary NetBackup Copy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 4.3 Tape CreatIon from a Non-PrImary NetBackup Copy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

SYMANTEC NETBACKUP OPENSTORAGE

1 Introduction
Data Domain deduplication storage with Symantec NetBackup OpenStorage software is not difcult to install or congure. Deployment is straightforward in most environments. However, deployments involving multiple sites and a complex environment may experience issues with naming conventions, network infrastructure, or site-generated documentation detailing the installation. Therefore, OpenStorage implementations should be well planned and documented so that they can be deployed more quickly with fewer challenges when compared to the use of ad-hoc techniques. Deployment is often followed by a series of trials or tests to prove that the solution is functioning as planned. In this guide, OpenStorage best practices are examined and discussed to assist in eliminating the bottlenecks associated with deployment and functional testing of the solution.

1.1 Target Audience


This guide is intended to help system users and vendor staff doing the OpenStorage deployment take advantage of substantial real-world knowledge Data Domain has gained from assisting other customers. A working knowledge of OpenStorage terms and concepts is recommended and can be gained from the following: 4 Data Domain OpenStorage Primer 4 Data Domain OpenStorage (OST) User Guide 4 Symantec NetBackup Shared Storage Guide

4 NetBackup media server load balancing, eliminating the need to manually divide client backups across NetBackup media servers utilizing OpenStorage storage units. 4 Tape consolidation Backup images from remote locations and branch ofces can be replicated to a centralized location where they can be duplicated to tape under the control of NetBackup.

2 Planning
Deciding to change naming conventions halfway through a deployment can be painful, more so if production backups were executed to previously-named components that later need to be deleted in order to be renamed. Likewise, reconguring portions of the IP network that connect NetBackup media servers and Data Domain deduplication storage systems halfway through a deployment can be problematic when testing with production backups. Combining name changes with network changes is made worse with poor or nonexistent documentation. While conguration changes are both possible and supported, a production environment may not be the best place to learn these techniques for the rst time. Production environments differ from lab environments in that the severity of a setback is likely to be less pronounced in the lab. Creating a plan and documenting the conguration forms the foundation for a successful deployment and subsequent test phases.

1.2 Executive Summary


This guide focuses on backups and the creation of duplicate backup copies with OpenStorage. Network congurations, optimized duplication, and disaster recovery are also examined. Recommended best practices, as well as strategies that are not recommended, are covered with the goal of enhancing OpenStorage solution planning and deployment. OpenStorage software provides API-based integration between Data Domain storage systems and NetBackup. The API gives NetBackup visibility into the properties and capabilities of the Data Domain storage system, control of the backup images stored in the system, and access to network-efcient replication to remote Data Domain storage systems. Supported with NetBackup 6.5 and higher, OpenStorage-enabled Data Domain storage systems and the Symantec NetBackup OpenStorage option provide key enhancements for disk-based data protection: 4 NetBackup-optimized duplication: Backup image duplication using Data Domain deduplication storage and network-efcient replication that is controlled, monitored, and cataloged by NetBackup. 4 Integrated NetBackup reporting of Data Domain replication job status. 4 Recovery of replicated backup images in their entirety or at a granular level via the NetBackup user interface. 4 Sharing of OpenStorage storage units among heterogeneous NetBackup media servers.

2.1 Naming Conventions


Nomenclature, the assigning of names to OpenStorage specic components, is considered important for numerous reasons. The objective is to use a naming convention that will be easily understood by the user, system engineer, and potentially any support personnel involved with the OpenStorage solution.

SYMANTEC NETBACKUP OPENSTORAGE

2.1.1 Data Domain Hostname


This is the hostname used to identify a system. Hostnames are resolved via DNS or Hosts les. Hostnames are not specically NetBackup objects, but when used as storage server names they tell the media servers at the TCP/IP level how to connect to the storage server. 4 Use the assigned fully-qualied hostnames. 4 Do not use IP addresses in place of hostnames when registering storage servers as this will limit the ability to route optimized duplication trafc exclusively through the registered interface. 4 Generally speaking, use DNS to resolve hostnames to IP addresses that are routable through the environment. 4 Use Hosts les to resolve those same hostnames to non-routable IP addresses. 4 When possible, avoid creating secondary hostnames to associate with alternate or local IP interfaces. High performance Data Domain systems can accommodate data transfer rates in single- or multi-stream modes that exceed the maximum performance of 1 GbE (Gigabit Ethernet) networks. Explicitly not recommended is the use of hostnames based on different network interfaces so as to increase the total available bandwidth to a single Data Domain system congured as a storage server. See the Networks section of this document for details.

At present, multiple LSUs can be congured on a single Data Domain system. This can create conicts for advanced NetBackup features such as media server load balancing, capacity reporting, etc., and is only a recommended best practice when the Data Domain system is being congured in multiple NetBackup installations.

2.1.4 Disk Pool


A disk pool is a NetBackup object that correlates to an LSU. 4 Ideally there should only be one disk pool per LSU. When multiple storage servers have been dened on a single Data Domain system, create one disk pool per storage server. 4 Name the object with a -dp extension to allow for easy identication as a disk pool object.

2.1.5 Storage Unit


This is a logical target that is created within NetBackup. A storage unit that is created of the type disk and the disk type of OpenStorage (Data Domain) has three properties: a storage unit name, a disk pool target, and a list of media servers. 4 Ideally, create one storage unit per disk pool. The creation of multiple storage units that assist in segregating disk pool resource usage based on workload requirements is also acceptable. 4 Name the object with an -su extension to allow for easy identication as a storage unit object type. 4 Only media servers that have credentials dened for the storage server can use the storage unit. 4 Select either use any available media server or specic media servers as appropriate. The selection of multiple media servers within the storage unit denition effectively enables media server load balancing. Client backups that can potentially connect to two or more media servers can take advantage of media server load balancing, where the least loaded media server (best candidate media server) is selected for use by the NetBackup master server. .

2.1.2 Storage Server


This is a logical object dened within NetBackup that points to a Data Domain system. NetBackup communicates with the storage server and uses credentials supplied by the tpcong utility to request use of a disk pool for backup and restore operations. 4 There should be only one storage server dened per Data Domain system. 4 Use the Data Domain systems fully qualied hostname as the storage server name. 4 This name must be unique across the enterprise. 4 The OpenStorage plug-in uses standard TCP/IP name resolution to nd the corresponding Data Domain system.

2.1.6 Storage Unit Group 2.1.3 Logical Storage Unit


Also referred to as an LSU, a Logical Storage Unit is a disk target within a storage server. 4 Create one LSU per Data Domain system when deployed in a single NetBackup domain. Create additional LSUs as applicable for any additional NetBackup domains. 4 Name the object with a -lsu extension to allow for easy identication as a logical storage unit object. This is a logical target that is created within NetBackup. A storage unit group is a collection of storage units that can be used based on selection criteria. 4 To allow for N+1 redundancy, consider the optional use of storage unit groups. 4 Create one storage unit group per OpenStorage storage unit using a -sug extension. 4 The preferred storage unit should be the rst storage unit in the list of storage units. 4 A second storage unit can be added to the group for use should the rst storage unit be rendered non-operational.

SYMANTEC NETBACKUP OPENSTORAGE

4 Select the Failover storage unit selection algorithm. 4 Note that there may be cases where a storage unit group has been created for use in conjunction with multiple storage units using disk pools on a common Data Domain system. In this case use of the Media server load balancing storage unit selection algorithm is recommended. Using storage unit groups is optional. Use of the Failover selection algorithm is a best practice as it facilitates sending the same backups to the same Data Domain system which will equate to a higher data deduplication ratio. In the event that the preferred storage unit enters a non-operational state, backups will be sent to an alternate storage unit. This methodology may be of interest for mission critical or otherwise important backup jobs. Alternatively, you may elect not to use storage unit groups for backups that do not require N+1 redundancy.
NetBackup Master Server NetBackup Media Server
LAN

NetBackup Media Server

LAN

LAN

WAN

Data Domain OpenStorage Storage Units


Figure 2: Geographically dispersed conguration Two NetBackup media servers and two Data Domain systems are shown. Each NetBackup media server has a local network connection to a co-located Data Domain system. A WAN also connects the Data Domain systems. Backup trafc uses the local network connection between a given NetBackup media server and its co-located Data Domain system. Optimized duplication trafc uses a separate NIC on each Data Domain system. This conguration may be preferred in cases where backup and restore data transfer rates require the use of a 10 GbE network, and a lower bandwidth network is able to accommodate optimized duplication trafc.

2.2 Networks
Varying degrees of network complexity are associated with a given OpenStorage deployment. At a minimum, a single Data Domain system congured as a storage server is network connected to a NetBackup media server. NetBackup optimized duplication adds additional requirements as does a conguration that leverages media server load balancing.

2.2.1 Example Network Topologies


This section reviews a sampling of possible network topologies: 4 NetBackup media server and Data Domain systems sharing a common LAN congured for optimized duplication (Figure 1) 4 NetBackup media servers and Data Domain systems in geographically different locations congured for optimized duplication (Figure 2) 4 NetBackup media server load balancing with a Data Domain system (Figure 3)

NetBackup Media Servers

NetBackup Client
LAN LAN

Data Domain OpenStorage Storage Unit

NetBackup Media Server

Figure 3: NetBackup media server load balancing

LAN Data Domain OpenStorage Storage Units

A NetBackup client that can be backed up through a number of different NetBackup media servers. The OpenStorage storage unit has been congured so that each NetBackup media server can access its resources. This enables NetBackup media server load balancing, where the least loaded media server is used to fulll a backup request. Additionally, this conguration allows NetBackup to bypass an ofine media server when fullling a backup or restore request.

Figure 1: Optimized duplication with a common LAN This is a simple example of a NetBackup master/media server LAN connected to two Data Domain systems. In this use case both backup and optimized duplication trafc use the same NIC (Network Interface Card) on a given Data Domain system.
SYMANTEC NETBACKUP OPENSTORAGE 5

2.2.2 Network-Recommended Configurations


Best practice recommendations are based on known reference deployments that exhibit desirable behavior and performance characteristics. Simplicity is preferred over complexity. Ease of deployment, simplied administration, and predictable results have yielded these general themes: 4 Data Domain recommends interconnecting NetBackup media servers and Data Domain systems using a dedicated backup area network. NetBackup clients and NetBackup media servers should ideally be interconnected through a separate front-end network. (Figure 4) 4 Dedicated backup networks segregate NetBackup media server and storage unit trafc from other network trafc. Contention issues are constrained to backup and recovery jobs. Known available bandwidth can be managed from the perspective of achieving aggressive data protection and recovery service levels. 4 Dedicated backup networks lay the foundation for a scalable infrastructure should data protection network bandwidth requirements change over time. 4 Data Domain recommends the use of a 10 GbE network infrastructure in cases where single stream or aggregate data transfer rates in excess of 125 MB/s are required between a single NetBackup media server and the Data Domain system.

deployments, the use of a dedicated backup network is preferred when compared to mixed use network congurations. When deploying a Data Domain system that can accommodate data transfer rates exceeding the capabilities of 1 GbE networks, using a 10 GbE infrastructure can overcome data transfer rate bottlenecks. Single stream performance that exceeds 125 MB/s dictates the need for a 10 GbE network connection. 4 Network topologies without a 10 GbE infrastructure The network connecting NetBackup media servers to a given Data Domain system can optimally incorporate the use of multiple 1 GbE interfaces so long as there is only one connection per NetBackup media server. (Figure 5) An alternative topology that utilizes multiple 1 GbE network connections on one or more NetBackup media servers is also possible.

1 GbE Private Network 1 GbE Private Network 1 GbE Private Network 1 GbE Private Network

10 GbE Private Network

Figure 5: Recommended use of multiple 1 GbE networks In the example shown above, each of four NetBackup media servers connects to a specic NIC on a Data Domain system congured as a single storage server. Each NetBackup media server is congured to use DNS or a local host le such that the storage server name resolves to a specic interface on the Data Domain system. This conguration accommodates NetBackup media server load balancing as it utilizes a single storage server, single disk pool, and a single storage unit. By default the storage unit is dened to allow all four NetBackup media servers to use the shared disk pool resource. This 1 GbE topology imposes limits on maximum single stream as well as aggregate data transfer rates from any single NetBackup media server to the Data Domain system. The combined data transfer rate of all NetBackup media servers can result in an aggregate data transfer rate that seeks to better utilize resources and achieve the maximum throughput possible on the Data Domain system.

Figure 4: Dedicated backup network Dedicated backup networks, as shown in gure 4, provide a number of tangible benets

In many cases there may be no fundamental reason to commingle NetBackup client network trafc with the network that connects NetBackup media servers and storage servers. When possible, the network used for NetBackup media server to Data Domain system communications should be segregated from other production networks. While not always possible based on customer requirements and pre-existing NetBackup media server and network infrastructure

SYMANTEC NETBACKUP OPENSTORAGE

NetBackup Media Server Eth0 Eth1 Eth2 Eth3

Data Domain OpenStorage Storage Unit

need arise to modify, alter, or enhance the solution, documentation lays the groundwork for moving forward. In the event that technical support or other assistance is required, documentation can assist in rapid problem isolation and resolution. 4 Topology Diagram: This basic diagram consists of a map of physical components labeled using the recommended naming conventions. Also included are the individual networks and IP addresses of the components. This common sense approach makes it possible for others within or outside of the organization to quickly understand the overall view of the deployed solution when needed. 4 Data Collection: Collecting and recording relevant conguration information is consistent with the creation of best practice documentation. On the NetBackup master server and each NetBackup media server used for the OpenStorage solution, the following commands (or their equivalent) should be executed:

Figure 6: Recommended use of multiple 1 GbE networks In the example above, each of four NetBackup media servers connects to a specic NIC on a Data Domain system congured as a single storage server. Each NetBackup media server is congured to use DNS or a local host le such that the storage server name resolves to a specic interface on the Data Domain system.

Multiple 1 GbE connections between a NetBackup media server and Data Domain system should be congured as follows: 4 Congure a single LSU on the Data Domain system. 4 Congure a unique storage server for each network connection. 4 Congure a unique disk pool for each storage server using the single LSU. 4 Congure a storage unit for each unique disk pool. 4 Optionally, create a storage unit group that includes the storage units congured earlier. The storage unit group should use the Media server load balancing storage unit selection algorithm. Backup policies can be congured to use the storage unit group versus specic storage units. Note that the following limitations exist with this conguration: 4 Capacity reporting within NetBackup is skewed as NetBackup believes that there are multiple separate physical storage servers. 4 Manual assignment of NetBackup policies to one of four storage units is required. Alternatively, a storage unit group may be used. 4 NetBackup media server load balancing may not function normally. To overcome this challenge, storage unit groups may be created that utilize Media Server load balancing selection criteria. 4 Administrative complexity and overhead is increased with additional network names, storage servers, disk pools, storage units, and possibly storage unit groups.

For host configuration:


4 uname a 4 hostname

For network configuration:


4 ifcong 4 cat /etc/hosts 4 route 4 cat /etc/resolv.conf 4 grep hosts /etc/nsswitch.conf

For NetBackup configuration:


4 /usr/openv/netbackup/bin/admincmd/bpstulist L 4 cat /usr/openv/netbackup/bp.conf 4 cat /usr/openv/netbackup/bin/version

For OpenStorage configuration:


4 /usr/openv/netbackup/bin/admincmd/bpstsinfo pi stype DataDomain 4 /usr/openv/netbackup/bin/admincmd/nbdevquery listdp 4 /usr/openv/netbackup/bin/admincmd/nbdevquery listdv stype DataDomain 4 /usr/openv/netbackup/bin/admincmd/nbdevquery liststs 4 /usr/openv/volmgr/bin/tpcong dsh all_hosts 4 /usr/openv/volmgr/bin/tpcong dsh stype DataDomain On each Data Domain system the following commands should be executed:

2.3 Documentation
With multiple NetBackup media servers, multiple Data Domain systems, and the potential use of multiple networks combined with different geographical locations, the importance of documenting the deployed solution cannot be overemphasized. Proper documentation enables various site and vendor groups including management, data protection administrators, and network administrators to understand and maintain the deployed solution. Should the

For host configuration:


4 hostname 4 system show version 4 user show list

SYMANTEC NETBACKUP OPENSTORAGE

For network configuration:


4 net show settings 4 net show hardware 4 net show dns 4 net hosts show 4 net aggregate show 4 net cong
10 GbE Private Network 10 GbE Private Network NetBackup Media Server NetBackup Media Server

For OpenStorage configuration:


4 ost status 4 ost lsu show 4 ost show user-name 4 ost show connections
1 GbE Public Network Optimized duplication source storage unit Optimized duplication destination storage unit

3 Optimized Duplication
Optimized duplication replicates deduplicated data between source and destination targets, and typically requires only a fraction of the network bandwidth consumed by backup or recovery jobs. Simple in principle, optimized duplication is also simple to congure once requirements are understood.

Figure 7: Separate source and destination NetBackup media servers Figure 7 depicts optimized duplication between two OpenStorage storage units. The NetBackup media server initiating an optimized duplication job needs to have credentials to access both the source and destination OpenStorage storage units.

3.1 Storage Units and Storage Server Access


The rst item to consider is source and destination storage units. If both reside on the same NetBackup media server, that NetBackup media server needs credentials to access both the source and destination Data Domain OpenStorage Servers. When the source and destination storage units reside on different NetBackup media servers, the NetBackup media server initiating the optimized duplication job requires credentials to access both the source and destination storage servers. What this means is that even though a particular NetBackup media server may never directly backup to or recover from a particular Data Domain OpenStorage Server, it still needs access credentials when the Data Domain OpenStorage Server is an optimized duplication destination.

Credentials are set by means of the NetBackup tpcong command on each NetBackup media server requiring access to a given OpenStorage storage unit. This allows the NetBackup media server to use the OpenStorage storage unit for backup and recovery jobs, as well as for optimized duplication. In cases where optimized duplication uses a destination OpenStorage storage unit that may be geographically distant from the NetBackup media server initiating optimized duplication, the storage unit denition should not allow the geographically distant NetBackup media server to use the storage unit for backup or recovery jobs. This is easily accomplished from within the NetBackup storage unit dialog window as shown in gure 8.

SYMANTEC NETBACKUP OPENSTORAGE

3.3 Throttling Optimized Duplication


Throttling is controlled at a global level on each Data Domain system. Limiting the rate of network bandwidth used by the replication process can be based on various criteria such as a scheduled or temporary rate. Caution should be exercised as throttling back network bandwidth consumption may extend optimized duplication run times. The net effect of network bandwidth throttling may impact recovery point objectives for disaster recovery. Other effects might include the queuing of jobs, as optimized duplication jobs contribute to destination storage unit concurrent jobs. Once a storage units maximum concurrent jobs parameter has been reached, new jobs requiring the use of the storage unit will be queued as they await storage unit resource availability. Based on service level requirements, it might be possible to limit the quantity of network bandwidth required by limiting the amount of data that needs to be replicated. One possibility worth considering is the optimized duplication of only full backups, where incremental backups are not duplicated. Different storage lifecycle policies can be employed for full and incremental backups should this methodology align with service level objectives.

Figure 8: Storage Unit dialog Use only specic media servers In the example above, the NetBackup storage unit named dd120b-stu has been congured to allow only the NetBackup media server named NBU65OST_Media2 to use it for backup and restore jobs. The NetBackup media server named NBU65OST_Media1 has credentials to access the storage unit for the purpose of initiating optimized duplication jobs.

3.4 Optimized Duplication Failures 3.2 Network Considerations


Replicating backup images under the control of NetBackup optimized duplication includes the ability to use the same network that is used for backup and restore operations, or to use a different network. In cases where optimized duplication trafc ows between geographically different locations, some customers have chosen to use a separate dedicated network connection. This connection links source and destination Data Domain systems specically for the purpose of replication controlled by NetBackup-initiated optimized duplication. User requirements to track WAN link usage may also prefer this approach. The network used for optimized duplication is based on network name resolution on the source Data Domain system. The destination Data Domain system is known to the source Data Domain system based on the IP address supplied by DNS, or by a local hosts le entry. Populating the source Data Domain systems hosts le with the desired IP address of the destination Data Domain system is all that is required to use a specic NIC and network. If this value is not present, NetBackup will perform optimized duplication using the same network it uses to access the source and destination Data Domain systems for backup and restore jobs. In the event that an optimized duplication job fails, the resulting action taken by NetBackup is dependent on the version of NetBackup being used. 4 Prior to NetBackup version 6.5.4, when an optimized duplication job fails, duplication job retry will attempt to use conventional duplication. This equates to sending the fully inated complete backup image from the source OpenStorage storage unit through one (or possibly two) NetBackup media server(s) to the destination OpenStorage storage unit. 4 NetBackup version 6.5.4, by default, will not retry a failed optimized duplication job unless it was initiated by means of a storage lifecycle policy. When retried via a storage lifecycle policy, the job will attempt to use optimized duplication only. The default behavior can be overridden by conguring NetBackup to retry with conventional duplication. See the Veritas NetBackup Shared Storage Guide UNIX, Windows, Linux Release 6.5.4 document for detailed information. At present there is no known way to congure a NetBackup storage lifecycle policy such that a failed duplication job will not be retried. However, a manually driven NetBackup utility nbstlutil can be used to cancel pending duplication operations.

SYMANTEC NETBACKUP OPENSTORAGE

3.5 Seeding Remote Data Domain Systems


New deployments looking to utilize optimized duplication may have network bandwidth limitations between sites that could cause the rst weeks duplication jobs to extend substantially. One solution to this dilemma is to seed the remote Data Domain OpenStorage Server locally, and then after a week or so relocate the Data Domain system to the intended site. The risk this presents is that while the remote system is in transitoptimized duplication failures may occur. One solution is to adjust the appropriate NetBackup storage unit so that no failures occur. By setting the storage unit Maximum concurrent jobs parameter to a value equal to zero, optimized duplication jobs will enter a queued state instead of failing, as shown in gure 9.

Beginning with DD OS version 4.6, stream count information is available in the Data Domain Operating System User Guide.

NetBackup monitors the number of jobs running on a particular storage unit in order to enforce the maximum number of current jobs value. Monitored jobs include backup and duplication operations. The storage unit maximum concurrent jobs parameter however, does not distinguish between backup and duplication jobs. There is no facility within NetBackup that can be used to limit backup and duplication jobs separately. Solution architects should be cautious and monitor the number of simultaneously executing backup and duplication jobs based on prescribed limits in order to assure optimal performance.

3.6 Duplication Job Configuration Options


NetBackup storage lifecycle policies provide the ideal vehicle for initial backups as well as the ability to create duplicate backup images. Storage lifecycle policy duplication tasks initiate optimized duplication jobs on OpenStorage storage units. 4 Data Domain recommends using NetBackup storage lifecycle policies to control optimized duplication. 4 Storage lifecycle policies facilitate setting different retention periods for backup and duplication jobs. 4 Data Domain recommends using Fixed retention periods versus Staged capacity managed and Expire after duplication retention period types. 4 Data classication can be used in conjunction with storage lifecycle policies if desired, but doing so is not a requirement for optimized duplication.
Figure 9: Storage unit dialog maximum concurrent jobs The storage unit Maximum concurrent jobs parameter is set to zero. Use this technique when relocating a Data Domain system from a local site to a nal destination site so that related optimized duplication jobs will enter a queued state instead of failing.

4 Storage unit groups can be used as backup destinations within storage lifecycle policies, but are not recommended for use as duplication destinations.

3.5.1 Storage Unit Maximum Concurrent Jobs


Data Domain systems have NVRAM-, memory-, model-, and operating system-dependent recommendations regarding maximum write, read, and replication stream counts. The best practice recommendation is to set the NetBackup storage unit maximum concurrent jobs parameter equal to the combined backup and replication stream count values for a given Data Domain system based on NVRAM, memory, model, and DD OS version. Restore jobs are not considered in this setting as they are typically performed infrequently.

Figure 10: Storage lifecycle policy This example contains a backup storage destination equal to storage unit dd120a-stu with a xed retention period of one week. The example also includes a duplication destination equal to dd120b-stu with a xed retention period of six months. The storage lifecycle policy has optionally been assigned a data classication value equal to Platinum. When the duplication task is executed it will result in an optimized duplication job that appears in the NetBackup activity monitor.

Stream count information is available from Data Domain for setting the NetBackup storage unit maximum concurrent jobs parameter.

10

SYMANTEC NETBACKUP OPENSTORAGE

Storage lifecycle policy duplication relies upon certain default settings that control the point at which a duplication job will be launched. An optional conguration le can be created to customize lifecycles to run duplication jobs based on customer requirements. Out of the box defaults as for NetBackup version 6.5 include: 4 MIN_KB_SIZE_PER_DUPLICATION_JOB = 8 GB 4 MAX_KB_SIZE_PER_DUPLICATION_JOB = 25 GB 4 MAX_MINUTES_TIL_FORCE_SMALL_DUPLICATION_JOB = 30 minutes Optimized duplication testing with backup images less than 8 GB in size may be delayed by up to 30 minutes as a result of the default settings. In a test only environment it may make sense to alter the default value for the MAX_MINUTES_TIL_FORCE_SMALL_ DUPLICATION_JOB to a value of less than 30 minutes. In some environments the default settings may be appropriate. The default settings can be adjusted by creating a LIFECYCLE_ PARAMETERS le. The Veritas NetBackup Administrators Guide, Volume I should be consulted for additional information before adjusting these values.

Figure 11: Maximum backup copies By default the NetBackup global attribute Maximum backup copies is set to a value of two. As shown above, altering the value to accommodate additional copies is easily performed via the administrative GUI.

4.1 Hierarchical Duplication


NetBackup version 6.5.4 introduces a new storage lifecycle policy capability referred to as hierarchical duplication. Hierarchical duplication provides the ability to create new duplicate backup image copies from a specic prior backup image copy. (Figure 12)

4 Duplication to Tape
Requirements to retain long term copies of backup images on removable tape media are easily integrated with OpenStorage solutions. Multiple means of accomplishing this objective currently exist, including enhanced functionality with NetBackup version 6.5.4. 4 NetBackup supports the duplication of backup images, not media or specic tape cartridges. 4 NetBackup supports the creation, cataloging, and tracking of up to ten copies of a particular backup image. 4 The default value for Maximum backup copies is two. 4 The Maximum backup copies parameter can be adjusted with the NetBackup administrative GUI via Host Properties > Master Servers > Global Attributes. (Figure 11)

Figure 12: Storage lifecycle policy with hierarchical duplication This storage lifecycle policy example includes an initial backup destination followed by two duplication destinations. The rst duplication destination will be created from the initial backup copy. The second duplication destination (indented) will be created from the previous duplicated copy. Copy 1 is the initial backup, copy 2 is created from copy 1, and copy 3 is created from copy 2

SYMANTEC NETBACKUP OPENSTORAGE

11

4.2 Tape Creation from the Primary NetBackup Copy


Versions of NetBackup prior to version 6.5.4 do not support hierarchical duplication from within storage lifecycle policies. The default NetBackup behavior is to create all duplicates using the primary backup image copy as source data. Regardless of backup image copy number, the primary copy (by default) will be used to fulll the duplication request. Four basic methods of initiating a duplication job using the primary copy currently exist within NetBackup: 4 Ad-hoc GUI-based duplication job initiation using the NetBackup catalog utility 4 Command line or script driven bpduplicate commands 4 Storage lifecycle policies 4 NetBackup Vault option The underlying requirement for these techniques is that the desired source backup image is the primary NetBackup copy, which may not always be the case. Take, for example, the execution of a storage lifecycle policy used in conjunction with optimized duplication. The initial backup to the rst Data Domain OpenStorage Server will be copy 1, and it will also be the primary copy. Optimized duplication of this copy to a second Data Domain OpenStorage Server will result in the creation of copy 2. Copy 2 will be a non-primary copy as long as copy 1 is still being retained, or until copy 2 is manually set to primary. Users seeking to create tape-based copies from copy 2 may elect to allow copy 1, the primary copy, to expire. At the point where copy 1 expires, copy 2 is set to primary by NetBackup. Execution of a properly congured NetBackup Vault Option policy will then use copy 2 to create copy 3. The criteria for conguring a NetBackup Vault Policy that accomplishes this objective is based on criteria that selects backups which occurred in the past, such that copy 1 no longer exists.
Figure 13: NetBackup Vault option prole The NetBackup Vault option provides the ability to specify granular selection criteria of backup images for duplication. In the example shown above, backups started between 16 and 15 days prior to the execution of a Vault job will be selected for inclusion. This strategy allows copy 1 of a backup image to expire, and copy 2 (which has been set to primary by NetBackup) to be used to fulll a duplication request.

4.3 Tape Creation from a Non Primary NetBackup Copy


The bpdulicate command can be executed via command line or a script based solution with the copy number (cn) parameter. This enables the creation of tape-based duplicate backup images from non-primary NetBackup image copies. See the NetBackup Command Guides for additional information.

5 Multiple NetBackup Domain Configurations


By denition a NetBackup domain refers to a unique NetBackup installation or instance. A NetBackup domain will consist of one master server, some number of media servers, and some number of clients. This section examines the use of Data Domain systems that are congured for use in multiple NetBackup domains, where two or more NetBackup master servers have congurations that relate to a single physical Data Domain system.

12

SYMANTEC NETBACKUP OPENSTORAGE

Data Domain recommends limiting the maximum number of NetBackup domains that share a Data Domain system based on the number of unique network interface connections present on the Data Domain system. For instance, a Data Domain system with two network interface connections could be shared by a maximum of two NetBackup domains. While this type of conguration provides obvious cost benets, there are applicable cautionary statements: 4 NetBackup capacity reporting may be skewed as each NetBackup domain sees the Data Domain system as a unique storage pool and believes that is has complete and total usage of the available capacity. 4 The maximum number of concurrent jobs supported by a single Data Domain system is effectively shared across more than a single NetBackup domain. There is no mechanism available to arbitrate the usage of available backup, restore, or optimized duplication streams between different NetBackup domains. These values are manually set by the user and may result in sub-optimal resource utilization. 4 Backup images cannot be shared between different NetBackup domains by means of conguring a Data Domain system for usage in two or more NetBackup domains.

Figure 14: Data Domain systems in multiple NetBackup domains This gure depicts two Data Domain systems, each of which is shared between two different NetBackup domains. Note that each Data Domain system has been congured with two LSUs, one for each NetBackup master server. Highly recommended in this type of deployment are naming conventions that assist in eliminating any potential confusion over what resources are used by each NetBackup domain.

Deploying a single Data Domain system in multiple NetBackup domains provides cost advantages when compared to the alternative of using different Data Domain systems for each NetBackup domain. Take, for example, the scenario where there are two NetBackup domains. Each NetBackup domain is in a different data center. Data center 1 performs local backups to a Data Domain system and then uses optimized duplication to create a copy of the backup images at data center 2. Likewise, at data center 2 local backups are performed to a Data Domain system and optimized duplicates are created at data center 1. Using non-shared Data Domain systems would require four separate systems. The shared model is cost effective in that it only requires 2 separate systems. Note: It is important to understand that there is no ability to share backups or NetBackup catalog data between NetBackup domains with this implementation. Data Domain recommends using naming conventions that assist in simplifying the shared use of Data Domain systems in multiple NetBackup domains. For instance, a single Data Domain system shared by two NetBackup domains would have two LSUs congured. Both LSUs will be visible to each different NetBackup domain. Naming the LSUs such that they include the Data Domain system name, the NetBackup master server name, and the -lsu sufx assists in properly conguring each LSU as a disk pool within NetBackup.

6 Disaster Recovery
6.1 Within the Same NetBackup Domain
This scenario assumes that the NetBackup instance in which recovery is to be performed is the same NetBackup instance that performed the initial backup and subsequent optimized duplication job. The rst copy of a backup image created by NetBackup is known as copy 1. When initially written, copy 1 is also known as the primary copy. As of NetBackup version 6.5.4 and prior, the primary copy is the copy that is used to fulll restore requests. An optimized duplication job will create copy 2 of the backup image. If copy 1 has a retention period of two weeks and copy 2 has a retention period of one year, copy 1 will be the primary copy until it expires, at which point copy 2 will become the primary copy. In the case where copy 1 has not expired and is still the primary copy, and a need arises to recover data from copy 2, copy 2 must be set to primary such that it can be used to fulll the restore request. Setting a particular copy to primary can be performed via the NetBackup GUI catalog utility.

SYMANTEC NETBACKUP OPENSTORAGE

13

6.1 Within the Same NetBackup Domain


This scenario assumes that the NetBackup instance in which recovery is to be performed is the same NetBackup instance that performed the initial backup and subsequent optimized duplication job. The rst copy of a backup image created by NetBackup is known as copy 1. When initially written, copy 1 is also known as the primary copy. As of NetBackup version 6.5.2 and prior, the primary copy is the copy that is used to fulll duplication requests as well as restore requests. An optimized duplication job will create copy 2 of the backup image. If copy 1 has a retention period of two weeks and copy 2 has a retention period of one year, copy 1 will be the primary copy until it expires, at which point copy 2 will become the primary copy. In the case where copy 1 has not expired and is still the primary copy, and a need arises to recover data from copy 2, copy 2 must be set to primary such that it can be used to fulll the restore request. Setting a particular copy to primary can be performed via the NetBackup GUI catalog utility.

Figure 17: Setting primary copy Setting a non-primary backup image copy to primary can be accomplished via a pop-up menu. This enables the use of a particular backup image to fulll restore requests. This can be useful in cases where recovery from a specic geographical location is desired or in cases where the original primary copy is not available.

6.2 To a Different NetBackup Domain


OpenStorage currently provides no automated ability to synchronize or share NetBackup catalog data across multiple NetBackup domains. OpenStorage backups can be imported into foreign NetBackup domains using standard NetBackup import procedures.

7 Additional References
Data Domain secure access customer support site: https://my.datadomain.com/
Figure 15: NetBackup catalog copy 1 primary copy The NetBackup catalog utility can be used to select the primary copy of a backup image.

OpenStorage (OST) User Guide OpenStorage (OST) Quick Start Data Domain Operating System User Guide Symantec: http://www.symantec.com/business/support/documentation.jsp? language=english&view=manuals&pid=15143 NetBackup Administration Guides NetBackup Shared Storage Guide NetBackup Vault Administrators Guide NetBackup Command Guides

6.1.1 From a Non-Primary Backup Copy


Assuming a particular backup image has been duplicated, copy 2 can be used to fulll a restore request if it is the primary copy. Similar to the way copy 1 can be queried from the NetBackup catalog utility, copy 2 can be queried using the same methodology.

Figure 16: NetBackup catalog copy 2 not primary The NetBackup catalog utility can be used to select copy 2 of a backup image.

Setting a non-primary copy to primary is easily accomplished by right-clicking the image and then selecting Set Primary Copy from the pop-up menu.

14

SYMANTEC NETBACKUP OPENSTORAGE

8 Conclusion
Data Domain support for Symantec NetBackup OpenStorage advances the ability to use disk as disk, store more data on disk with inline deduplication, and simplify the creation of backup copies with optimized duplication. Creating duplicate backup copies with optimized duplication enables advanced disaster recovery strategies. Disaster recovery copies of backup images are created faster, and are available at the disaster recovery location sooner when compared to tapebased solutions. Creating any required tape-based duplicate backup copies, sourcing the duplicate from either the initial backup or the optimized duplicate copy, facilitates long-term retention of backup images. Consolidated tape operations, where fewer data centers create tape-based backup copies, is made possible via the exibility to duplicate to tape at either source or destination data centers.

9.2 Existing Backups Retain or Duplicate?


At some point during the migration process, all ongoing backup jobs will theoretically use newly congured OpenStorage storage units. Backups performed to legacy basic disk or VTL media manager storage units will still exist based on the retention periods of the backup policies used to perform the backups. Should these backups be duplicated to new OpenStorage storage units and then be expired, or should they simply be left to expire naturally? When the retention period of backups written to legacy basic disk or VTL media manager storage units is relatively short, natural expiration may be the logical choice as it imposes no additional administrative overhead into the migration process. Other factors to consider are the number of backups that would need to be duplicated, and the quantity of data that would need to be duplicated. Duplicating a large number of backup images or a large quantity of backup data may not be realistic based on the additional workload it will impose on the backup infrastructure. Duplicating existing backups to new OpenStorage storage units has they key advantage of allowing the legacy infrastructure to be deleted. Once all backup images on basic disk or VTL media manager storage units have been duplicated to OpenStorage storage units, the old storage units can be deleted.

8 Appendix Migration to OpenStorage


Existing deployments using Data Domain systems as basic disk storage units or VTL media manager storage units may at some point be migrated to an OpenStorage solution. This appendix item explores the questions and strategies involved with migration. 4 Can a Data Domain system be used simultaneously as both an OpenStorage storage unit and a basic disk storage unit? 4 Can a Data Domain system be used simultaneously as both an OpenStorage storage unit and a VTL media manager storage unit? 4 Should existing backup images be retained until their expiration, or should they be duplicated to a new OpenStorage storage unit? 4 Are storage lifecycle policies required for use with OpenStorage? 4 How are existing NetBackup policies modied to use OpenStorage storage units? 4 Should legacy Data Domain replication be de-congured? 4 Can existing replicas be imported into the NetBackup catalog?

9.3 Are Storage Lifecycle Policies Required?


Storage lifecycle policies provide a plan-based view of backup and duplication jobs, and can be associated with a data classication rank. While not explicitly required they provide a simple and effective vehicle to perform backups followed by optimized duplication between two storage servers. In cases where a single Data Domain system is congured as a storage server, or where optimized duplication between storage servers will be performed using alternative means, the use of a storage lifecycle policy is optional.

9.4 NetBackup Policy Modification


Once OpenStorage storage units are congured, existing NetBackup policies that use basic disk or VTL media manager storage units can be updated to use the OpenStorage storage units. Change existing NetBackup policies by selecting the appropriate OpenStorage storage unit or storage lifecycle policy as appropriate and save the NetBackup policy. All backups performed with the NetBackup policy from this point moving forward will use the new OpenStorage storage unit. Optimized duplication will be invoked automatically by NetBackup if congured within a selected storage lifecycle policy.

8.1 Multiple Protocols on One Data Domain System


While not specically recommend, the simultaneous use of multiple protocols on the same Data Domain system is supported. One Data Domain system can simultaneously serve as a storage server, basic disk storage unit, and VTL. This functionality is particularly useful when migrating from basic disk or VTL usage to an OpenStorage solution.

9.5 Legacy Replication


At the point where legacy replication is no longer required, it can be disabled. OpenStorage optimized duplication requires that the Data Domain replication license remain intact on both source and destination storage servers.

SYMANTEC NETBACKUP OPENSTORAGE

15

9.5.1 Can Legacy Replicas Be Imported?


Existing replicated backup images that were created without OpenStorage continue to have the same limitations they had before the OpenStorage solution was implemented. If the source copy of a particular backup image is cataloged by NetBackup, any replica copies of the backup image are in effect already cataloged and cannot be imported. It is important to note that legacy replicas are not known to NetBackup as copy 2 of a particular backup image. These replicas were created without the knowledge of NetBackup and have the same backup identier as the source image. Note that duplicate copies of backup images created with OpenStorage optimized duplication do not need to be imported because the NetBackup catalog is already aware of their existence and is tracking them accordingly.

9.6 Deleting Legacy Storage Units


Once the OpenStorage solution has been implemented and all pre-existing backup images have expired, the legacy storage unit components can safely be deleted from the NetBackup conguration.

Data Domain | 2421 Mission College Blvd., Santa Clara, CA 95054 | 866-WE-DDUPE, 408-980-4800
Copyright 2010 Data Domain LLC. All rights reserved. Data Domain, Inc. believes information in this publication is accurate as of its publication date. This publication could include technical inaccurancies or typographical errors. The information is subject to change without notice. Changes are periodically added to the information herein; these changes will be incorporated in new additions of the publication. Data Domain, Inc. may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time. Reproduction of this publication without prior written permission is forbidden. The information in this publication is provided as is. Data Domain LLC makes no representations or warranties of any kind, with respect to the information in this publication, and specically disclaims implied warranties of merchantability or tness for a particular purpose. Data Domain and Global Compression are trademarks of Data Domain LLC. All other brands, products, service names, trademarks, or registered service marks are used to identify the products or services of their respective owners.WP-OSTBPG-0310

DEDUPLICATION STORAGE

www.datadomain.com

You might also like