Professional Documents
Culture Documents
Abstract
The HP Matrix Operating Environment 7.1 Recovery Management User Guide contains information on installation, configuration, testing, and troubleshooting HP Matrix Operating Environment recovery management (Matrix recovery management).
Copyright 2012 Hewlett-Packard Development Company, L.P. Confidential computer software. Valid license from HP required for possession, use or copying. Consistent with FAR 12.21 1 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor's standard commercial license. Warranty The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein. Acknowledgments Microsoft, Windows, and Windows Server are U.S. registered trademarks of Microsoft Corporation. VMware, VMware Server, GSX Server, ESX Server, and VMotion are trademarks of VMware, Inc. Adobe and Acrobat are trademarks of Adobe Systems Incorporated. Revision history For supported operating systems, see the HP Insight Management 7.1 Support Matrix available at http://www.hp.com/go/matrixoe/docs.
Table 1
Document part number 59002276 59002276 59002276 59002276 5900-2035 Software version 7.1.0 7.1.0 7.1.0 7.1.0 7.0.0 Documentation edition 4 3 2 1 1 Publication date September 2012 August 2012 July 2012 June 2012 February 2012
Contents
1 Matrix recovery management Overview........................................................5 2 Installing and configuring Matrix recovery management..................................8
Installation and configuration overview........................................................................................8 Installation and configuration prerequisites...................................................................................8 Installing and licensing Matrix recovery management....................................................................8 Uninstalling Matrix recovery management...............................................................................9 Setting up Networking..............................................................................................................9 Setting up Storage..................................................................................................................10 General storage setup notes................................................................................................11 HP P6000 storage setup notes.............................................................................................11 HP P9000 storage setup notes.............................................................................................12 HP 3PAR storage setup notes...............................................................................................12 Creating and installing a User Defined storage adapter..........................................................13 Setting up Local Site logical servers...........................................................................................16 Setting up Remote Site logical servers........................................................................................16 Configuring Matrix recovery management.................................................................................17 Matrix recovery management GUI overview..........................................................................17 Matrix recovery management configuration steps...................................................................18 Matrix recovery management export and import operations....................................................20 DR protection for IO services....................................................................................................22 DR Protection of IO services configuration overview................................................................22 Configure IO properties......................................................................................................23 Configure OO workflow for optional email notification...........................................................24 Network configuration........................................................................................................25
6 Troubleshooting........................................................................................51
Configuration troubleshooting..................................................................................................51 Configuration error messages...................................................................................................53 Warning messages.................................................................................................................56 Matrix recovery management Job troubleshooting.......................................................................57 Failover error messages...........................................................................................................60 Matrix recovery management log files.......................................................................................61 DR Protected IO serivces troubleshooting...................................................................................61 DR Protected IO services configuration troubleshooting...........................................................62 DR Protected IO services failover troubleshooting...................................................................64
Glossary....................................................................................................68
Contents
Features and benefits of Matrix 7.1 recovery management Provides an automated failover mechanism for DR Protected logical servers, DR Protected IO services, and associated storage. Provides a disaster recovery solution for logical servers and IO services managed by the HP Matrix Operating Environment. NOTE: Supports DR Protection of IO services with virtual server groups only. DR Protection of IO services associated with physical server groups is not supported at this time. Extends the HP Matrix Operating Environment value of a common management interface to the realm of disaster recovery for logical servers running on physical or virtual machines and for IO services running on virtual machines. Supports hypervisor hosted virtual machines. Supports standalone Hyper-V. Supports Microsoft Windows Server 2008 with Hyper-V R2 in a clustered configuration with CSV. Supports flexible cross-technology logical servers. VC hosted logical servers can fail over to become VM hosted logical servers, and VM hosted logical servers can fail over to VC hosts. Supports multiple logical servers and IO services in a Recovery Group. Supports bidirectional failover of Recovery Group Sets between two sites, allowing both activated and deactivated Recovery Groups to exist at the same site. Supports HP P6000 Continuous Access Software storage replication in synchronous and asynchronous modes. Supports HP P9000 Continuous Access Software storage replication in synchronous, asynchronous, and asynchronous journal modes. Supports HP 3PAR synchronous and asynchronous data replication. Supports integration with the remote failover features of installed storage replication products, other than HP P6000, HP P9000, or HP 3PAR. These storage replication products are referred to as User Defined storage in Matrix recovery management.
Includes Recovery Group startup order settings that let you determine which Recovery Groups are recovered first during a site failover. Includes a Copy feature that makes it easy to create multiple Storage Replication Groups with the same configuration parameters.
By reading this HP Matrix Operating Environment 7.1 Recovery Management User Guide, you will gain a better understanding of Matrix recovery management concepts and configuration testing. The Matrix recovery management graphical user interface (GUI), online help, and tooltips provide task-specific guidance.
Setting up Networking
It is assumed that networking links are present between the Local Site and the Remote Site. You can use Matrix recovery management in a variety of networking configurations, but it is important that you take note of the following Matrix recovery management networking configuration parameters: In a Matrix recovery management configuration, workloads can be running on DR Protected logical servers at both sites. A Recovery Group can only be activated (workloads running) on one site at any time, but it can be activated at either site such that workloads associated with different Recovery Groups can be running at both sites simultaneously. For this reason, network services such as DNS, DHCP, WINS, and AD must be available locally at both sites. If one site becomes inoperative due to a disaster, network services continue to be available at the other site based on the native disaster recovery capability in these services. This ensures that workloads can be failed over from the failed site to the other site in the Matrix recovery management configuration. Matrix recovery management must not be used to failover network services. NOTE: The Matrix recovery management startup order feature is intended to start up critical applications first, not to ensure that startup dependencies are met between applications and infrastructure services such as networking. Matrix recovery management does not perform DNS updates or update the IP configuration of recovered logical servers during a failover operation. Your Network Administrator is responsible for making the necessary modifications to ensure that network services are available if you configure a logical server to use a different IP or subnet at each site in the Matrix recovery management configuration. When running on physical targets (VC hosted) or non VMware ESX virtual targets (VM hosted), Matrix recovery management does not ensure that logical servers use the same MAC addresses at both sites. When running on VMware ESX hosted virtual targets, Matrix recovery management does ensure that logical servers use the same MAC address at both sites. Your Network Administrator needs to plan for this in the networking configuration for DR Protected logical servers, if you are using DHCP. NOTE: For MAC address details for cross-technology logical servers (logical servers that are capable of running on both VC hosts and VM hosts), see Dynamic workload movement with CloudSystem Matrix. When running on HP Virtual Connect hosted physical targets, the Portable Images Network Tool (PINT) must be used to prepare the server image to execute on targets with different network interface configurations and MAC addresses. To use PINT, the Local and Remote Sites must be on the same network, and the OS image must be a Linux version that is supported by Matrix recovery management. PINT ensures that the static network configuration from the source server is successfully transferred to the destination server network interfaces, despite the different environment. The executables and README are in the <SMP>/PI/PINT folder, where <SMP> is the folder where HP Insight Control server migration is installed. Copy the executable cp011231.exe to the physical server where the image is currently running. Run cp011231.exe to install PINT and start the PINT service. For information on supported Linux versions and HP Insight Control server migration, see the HP Insight Management 7.1 Support Matrix at http://www.hp.com/go/matrixoe/docs.
Setting up Networking 9
For information on PINT, see the Portable Images Network Tool (PINT) Linux readme version 1.0.0 available at http://h20000.www2.hp.com/bc/docs/support/SupportManual/ c01726723/c01726723.pdf?jumpid=reg_R1002_USEN. If the Local Site and corresponding Remote Site managed servers share a common subnet, you must ensure that there is no conflict between MAC addresses assigned by HP Virtual Connect Enterprise Manager (VCEM). For example, if the default address range provided by VCEM is used at both sites, conflict can be avoided by using the VCEM exclusion ranges feature. For example, on the Local Site CMS, exclude addresses from 00-21-5A-9B-00-00 to 00-21-5A-9B-FF-FF, and on the Remote Site CMS, exclude addresses from 00-21-5A-9C-00-00 to 00-21-5A-9C-FF-FF. For DR protected logical servers and IO services, ESX port group names and Hyper-V virtual network names must be identical at the Local Site and the Remote Site.
Setting up Storage
Matrix recovery management depends on storage array replication to enable failover of logical servers. It is assumed that storage replication links are present between the Local Site and the Remote Site. HP P6000 Continuous Access Software Disaster Recovery Groups are a good example of this concept. To set up Matrix recovery management Storage Replication Groups: 1. Boot and data LUNs for VC or VM hosted logical servers that are to be DR Protected must be replicated using supported storage replication methods, for example, HP P6000 Continuous Access, HP P9000 Continuous Access, HP 3PAR, or a User Defined storage replication method. User Defined storage replication is supported when you create and install a storage adapter in your Matrix recovery management configuration for a storage type other than HP P6000, P9000, or HP 3PAR. For example, if you create and install a storage adapter named EMC, the Storage server type drop-down menu for configuring a Storage Management Server includes EMC as a storage server type. For more information see Creating and installing a User Defined storage adapter (page 13). If a DR Protected logical server at the Local Site is VC hosted, the replicated boot and data LUNs on the array at the Remote Site must be presented to the corresponding recovery logical server. If a DR Protected logical server at the Local Site is VM hosted, the replicated boot and data LUNs on the array at the Remote Site must be presented to the VM host (for example, ESX) at the Remote Site that is targeted to run the recovery logical server (for example, ESX guest).
NOTE: Each Recovery Group has a single Storage Replication Group that is used by the logical servers in that Recovery Group only. All boot and data LUNs used by these logical servers must be included in the same Storage Replication Group. A Storage Replication Group is a set of storage LUNs on a particular disk array that are replicated with write order preserved. This corresponds to the HP P6000 Continuous Access DR Group concept as well as the HP P9000 Continuous Access consistency group concept.
2. 3.
Create one Storage Replication Group for each set of logical servers that will be included in a Recovery Group. Record the following details about the Storage Replication Group configuration. This information is required when you configure Matrix recovery management for replicated storage: Local and Remote Site storage identifiers, for example, an HP P6000 storage array WWN or an HP P9000 array serial number
10
NOTE: In the same way that conflict in the configuration of MAC addresses at the Local and Remote Sites is avoided in Setting up Networking (page 9), conflict must also be avoided in the configuration of WWNs, if the WWNs are not private to the respective sites. The same technique using VCEM exclusion ranges is available for array WWN configuration. Storage Management Server FQDN names and credentials for the Local and Remote Sites, for example, HP P6000 Command View server name and credentials to access the Command View server Storage Replication Group name given for the boot and data LUNs of the logical servers that will be part of the same Recovery Group, for example, the HP P6000 DR Group name NOTE:
HP P6000, HP P9000, and User Defined Storage Replication Groups must use the same Storage Replication Group name at the Local Site and the Remote Site. HP 3PAR remote copy Storage Replication Groups will have different names at the Local Site and the Remote Site.
Storage port WWN and LUN for the replicated LUN, in the case of raw LUNs used by DR Protected logical servers
HP P6000 Continuous Access Software documentation available at http:// h20000.www2.hp.com. Cick Manuals, then go to StorageStorage SoftwareStorage Replication SoftwareHP P6000 Continuous Access Software. HP P6000 Command View Software documentation available at http:// h20000.www2.hp.com. Click Manuals, then go to StorageStorage SoftwareStorage Device Management SoftwareHP P6000 Command View Software.
If the password for a Storage Management Server is changed, take the following actions to refresh the Storage Management Server password on the CMS and in the Matrix recovery management configuration: 1. Discover the Storage Management Server with the changed password. 2. Go to the Matrix recovery management user interface Storage Management Servers tab. 3. Select the Storage Management Server that has the changed password, and click on Edit. 4. Select the Refresh SIM Password box and click Save.
HP P9000 Cluster Extension Software documentation available at http:// h20000.www2.hp.com. Click Manuals, then go to StorageStorage SoftwareStorage Replication SoftwareHP Cluster Extension Software. HP P9000 RAID Manager Software documentation available at http:// h20000.www2.hp.com. Click Manuals, then go to StorageStorage SoftwareStorage Device Management SoftwareHP P9000 RAID Manager Software.
If the password for a Storage Management Server is changed, take the following actions to refresh the Storage Management Server password on the CMS and in the Matrix recovery management configuration: 1. Discover the Storage Management Server with the changed password. 2. Go to the Matrix recovery management user interface Storage Management Servers tab. 3. Select the Storage Management Server that has the changed password, and click Edit. 4. Select the Refresh SIM Password box and click Save.
12
Command Line Software. Both must be installed on the Central Management Server where Matrix recovery management is installed. For more information, see the following:
HP 3PAR Cluster Extension Software documentation available at http:// h20000.www2.hp.com. Click Manuals, then go to StorageStorage SoftwareStorage Replication SoftwareHP Cluster Extension Software. HP 3PAR InForm Command Line Software documentation available at http:// h20000.www2.hp.com. Click Manuals, then go to StorageStorage SoftwareStorage 3PAR Device Management SoftwareHP 3PAR InForm Software.
You need an encrypted password file to manage storage replication on an HP 3PAR storage system. An encrypted password file can be created by running HP 3PAR InForm Command Line Software commands. For more information, see the setpassword command in the HP 3PAR InForm Command Line Reference. The encrypted password file must be present in the STORAGE\3PAR\conf directory where Matrix recovery management is installed. The encrypted password file replaces the need for a user name, domain name, and password required with other types of storage management servers. The encrypted password file for both the Local Site and Remote Site Inserv storage servers must be available on the CMS at each site, and the name of the password file must be the same on the CMS at each site. If you upgrade the HP 3PAR InForm Command Line Software on the CMS with a software version that is supported by Matrix OE and HP 3PAR Cluster Extension Software, you must modify a property in the Matrix recovery management properties file. Change the INFORM_CLI_VERSION property in conf\hp_ir.properties where Matrix recovery management is installed on the CMS. The default value of the property is set to 3.1.1.
NOTE: After you install and configure Matrix recovery management at the Local Site, you must manually failover 3PAR remote copy groups to the Remote Site before attempting to configure install and configure Matrix recovery management at the Remote Site. For 3PAR periodic (asynchronous) remote copy, the manual failover action will not synchronize the data in the remote copy group volumes. To prevent data loss, synchronize the remote copy groups before performing stop and failover operations on the remote copy groups.
Setting up Storage
13
Managing nonintegrated storage with Matrix recovery management If your DR Protected logical servers use a nonintegrated storage system that is supported by Matrix OE, and you want Matrix recovery management to automatically invoke storage failover for the nonintegrated storage using the Matrix recovery management Activate operation, you must: 1. Implement and thoroughly test the three commands defined in the User Defined storage adapter interface specification, then perform testing at the Local Site and at the Remote Site. 2. Create a new subdirectory under the STORAGE directory where Matrix recovery management is installed. Give the subdirectory a name that identifies the storage type being managed. This name will appear in drop-down menus in the Matrix recovery management GUI. Perform this step at the Local Site and at the Remote Site. NOTE: The name of the subdirectory must be exactly the same at the Local Site and at the Remote Site. 3. 4. Place your implementation of the User Defined storage adapter commands in the subdirectory created for the storage type. Perform this step at the Local Site and at the Remote Site. Define local and remote Storage Management Servers for the nonintegrated storage by selecting the storage type from the drop-down menu in the Storage Management Servers tab in the Matrix recovery management GUI. The storage type in the drop-down menu has the same name as the subdirectory created in step 2. Using management tools for the nonintegrated storage system, create a Storage Replication Group for the storage used by the logical servers that will be DR Protected. Create a Storage Replication Group for the nonintegrated storage by using the Matrix recovery management GUI. The storage type in the drop-down menu has the same name as the subdirectory created in step 2. Create a Recovery Group by using the Matrix recovery management GUI, and associate that Recovery Group with the Storage Replication Group for the nonintegrated storage. Perform a Matrix recovery management Export operation at the Local Site to generate an exportconfig file, then perform an Import operation to import that exportconfig file at the Remote Site.
5. 6.
7. 8.
User Defined storage adapter interface specification The following commands are defined in the User Defined storage adapter interface specification: validatesms.cmdValidates a Storage Management Server during configuration validatesrg.cmdValidates a Storage Replication Group during configuration failoversrg.cmdFails over a Storage Replication Group while Recovery Group activation occurs For validatesms.cmd: sms_name=<name of a Storage Management Server> sms_username=<login name for a Storage Management Server> For validatesrg.cmd: sms_name=<name of the Storage Management Server at the Local Site> sms_username=<login name for the Storage Management Server at the Local Site> srg_name=<name of the Storage Replication Group to be validated> local_storage_id=<unique identifier of the storage system at the Local Site>
Command-line arguments
14
remote_storage_id=<unique identifier of the storage system at the Remote Site> NOTE: Volumes in the Storage Replication Group (identified by srg_name) are replicated between the local storage system (identified by local_storage_id) and the remote storage system (identified by remote_storage_id). For failoversrg.cmd: local_sms_name=<name of the Storage Management Server at the Local Site> local_sms_username=<user name for the Storage Management Server at the Local Site> local_storage_id=<unique identifier of the storage system at the Local Site> remote_sms_name=<name of the Storage Management Server at the Remote Site> remote_sms_username=<user name for the Storage Management Server at the Remote Site> remote_storage_id=<unique identifier of the storage system at the Remote Site> srg_name=<name of the Storage Replication Group to be failed over> use_non_current_data=<yes or no> NOTE: A value of yes requires the Storage Replication Group to be failed over even in cases where the data at the destination site may not be current. A value of no requires storage failover to fail if the data at the destination is not current. Example invocations of Matrix recovery management User Defined adapter implementation During Storage Management Server configuration in Matrix recovery management:
<Matrix recovery management installed directory>/STORAGE/EMC/validatesms.cmd sms_name=EMC_SMS1 sms_username=admin
Command return code Commands must return 0 on successful completion, and a nonzero error code indicates failure. Multiple User Defined storage adapters Matrix recovery management supports multiple User Defined storage adapters to co-exist in a Matrix recovery management configuration. For each User Defined storage adapter type, you can create a new subdirectory and place your implementation of the three commands for that storage type. For example, to add a storage adapter named EMC, create an EMC subdirectory under
Setting up Storage
15
<Matrix recovery management installed directory>/STORAGE and copy all three commands to the newly created directory: <Matrix recovery management installed directory>/STORAGE/EMC/validatesms.cmd <Matrix recovery management installed directory>/STORAGE/EMC/validatesrg.cmd
<Matrix recovery management installed directory>/STORAGE/EMC/failoversrg.cmd If your implementation of the storage adapter commands requires passwords to manage storage replication, the storage adapter command implementation must handle passwords securely. It is your responsibility to encrypt/decrypt passwords while saving and retrieving them.
In the case of a cluster shared volume or shared cluster disks, the replicated disk on the Remote Site Hyper-V host must be configured with the same cluster resource name that is assigned to the Local Site disk it is replicated from. When creating recovery logical servers, you must specify the datastore name for the logical server. The datastore name selected must be the same as the datastore name for the Local Site logical server. The names of recovery logical server storage entries must be the same as the names of the logical server storage entries on the Local Site. The names of recovery logical server storage pool entries must be the same as the names of the logical server storage pool entries on the Local Site.
b. 4.
Refresh Virtual Machine resources using the Logical Servers Refresh operation in the Tools menu of the Visualization tab in Matrix OE visualization.
Use Matrix OE visualization to create recovery logical servers. Specify the replicated LUNs information as part of logical server creation. When entering storage information during VM hosted logical server creation, select the datastore name of the datastore where the VM hosted logical servers were created.
NOTE: Do not activate the recovery logical servers at this time. During the Matrix recovery management configuration process, the recovery logical servers will be further configured in the configuration import process at the Remote Site and there will be an opportunity to activate and deactivate the recovery logical servers at that time. To avoid confusion, HP recommends adopting a best practice of using the same logical server name at the Remote Site as the one that was used for the associated logical server at the Local Site. For information on cross-technology logical servers (logical servers capable of being VC hosted or VM hosted), see Dynamic workload movement with CloudSystem Matrix. For sites with a large number of logical servers, partitioning logical servers into portability groups can reduce activation time during failover. HP recommends that the portability group associated with logical servers on both the Local Site and the Remote Site be limited to a subset of virtual machine hosts and virtual connect blades that are capable of hosting these logical servers. For additional information on configuring portability groups, see Logical serversMenus & screensManage portability groups in the HP Matrix OE Visualization and Logical Servers online help.
17
Matrix recovery management user interface tabs Home Information on the most recent Matrix recovery management Job is displayed at the top of the Matrix recovery management Home screen, including the Latest Job status:, the Job Id:, the Start Time: (if the job is in progress), or the End Time: (if the job has completed). This is followed by a list of the other Matrix recovery management configuration tabs with Configured or Not configured icons to indicate if the configuration tasks for each tab have been completed. Sites Configure Preferred and Secondary Sites. Activate or Deactivate Recovery Groups. Edit or delete existing Site configurations. Export or import Site configurations. Storage Management Servers Define new Storage Management Servers. View, edit, or delete Storage Management Servers. Storage Replication Groups Create new Storage Replication Groups. View configuration details for Storage Replication Groups. Copy, edit, or delete Storage Replication Groups. Recovery Groups Create new Recovery Groups. View configuration details for Recovery Groups. Import, edit or delete Recovery Groups. Enable or Disable Maintenance Mode on Recovery Groups. Jobs Monitor Job progress and view Job details. Cancel Jobs in progress. Restart failed Jobs. View logs for Jobs and Sub Jobs. Delete Job information. The Matrix recovery management online help and tooltips provide answers to questions you may have while using the GUI.
configuration process and to help ensure that the two sites have synchronized configurations, you can export the Matrix recovery management configuration information to a file at the Local Site, move that file to the Remote Site, and then import the Matrix recovery management configuration information at the Remote Site. Figure 3 Configuration steps
NOTE: Make sure all of the steps in Setting up Remote Site logical servers (page 16) have been completed, before you attempt the following Matrix recovery management configuration steps at the Local Site. 1. 2. 3. 4. From the Sites tab, configure the Local Site. From the Storage Management Servers tab, configure Storage Management Servers at the Local Site. From the Storage Replication Groups tab, configure Storage Replication Groups at the Local Site. From the Recovery Groups tab, configure Recovery Groups at the Local Site. NOTE: Matrix recovery management allows failover of the logical servers or the IO services in a Recovery Group, independent of the associated Storage Replication Groups. This capability is referred to as Storage Decoupling. 5. 6. From the Sites tab, create an export file at the Local Site. For information on export and import parameters, see Matrix recovery management export and import operations (page 20). From the Sites tab at the Remote Site, import the Local Site Matrix recovery management configuration. For more information, see Matrix recovery management export and import operations (page 20). Test the recovery logical servers. All imported Recovery Groups are in maintenance mode, allowing activation of the recovery logical servers. For more information, see Testing Recovery Groups (page 26).
Configuring Matrix recovery management 19
7.
8. 9.
Deactivate the recovery logical servers and disable Maintenance mode at the Remote Site for more information see Testing Recovery Groups (page 26) Fail back the Storage Replication Groups to the Local Site, then activate the Local Site logical servers. If there are VM hosted logical servers, use the VMware Virtual Center or the Microsoft Hyper-V Management Console to rescan and refresh virtual machine resources.
Import
The site locality of each Storage Management Server configured at the importing site must match the site locality of a Storage Management Server in the exportconfig file. The type of Storage Management Server must match if the name matches. For HP P6000, the user name and port number fields must match. For HP P9000, the user name and RAID instance number must match. For HP 3PAR storage system, the password file name must match.
If any one of the above items is not matched between the exporting site and the importing site, the import operation fails.
20
NOTE: To manage HP 3PAR remote copy, the encrypted password file for both the Local Site and Remote Site Inserv storage servers must be available on the CMS at each site, and the name of the password file must be the same on the CMS at each site. Storage Replication Group information associated with activated Recovery Groups in the exportconfig file is imported if the importing site has no Storage Replication Group configuration. If the importing site already contains Storage Replication Group information:
If the Storage Replication Group is included in another Recovery Group that belongs to the same Recovery Group Set, the import is allowed. If the Storage Replication Group is included in another Recovery Group that belongs to a different Recovery Group Set, the import fails. It does not matter whether this Recovery Group is activated or deactivated at the importing site.
All entities referenced by imported Recovery Groups are imported, including Sites, Storage Management Servers, and Storage Replication Groups. At the end of an import, only the imported Recovery Groups and their associated entities remain. Storage not attached to the activated Recovery Groups being imported is deleted. If a Recovery Group in the exportconfig file has the same name as a Recovery Group at the importing site, but it belongs to a different Recovery Group Set, the import fails. Storage Management Servers and Storage Replication Groups that are not referenced by the imported Recovery Groups are not imported. If the Recovery Groups in the exportconfig file already exist at the importing site (Recovery Groups with the same name are at the importing site), they are deleted and replaced with the imported Recovery Groups. The Storage Replication Groups referenced by Recovery Groups that are deleted are also deleted.
Single Recovery Group import If you add or modify (edit) a Recovery Group at a Matrix recovery management site, there is a single Recovery Group import feature that saves time by propagating those changes to the other site without performing a site import operation. For example, if you add or modify (edit) a Recovery Group at the Local Site, you can import the new or modified Recovery Group to establish its peer Recovery Group at the Remote Site without importing the entire Local Site configuration. The procedure at the Local Site is the same as the site export procedure. From the Sites tab at the Local Site you export the Matrix recovery management configuration to an exportconfig file and move that file to the Remote Site. From the Recovery Groups tab at the Remote Site, click import...Select import file.... The Import Wizard window appears allowing you to select a Recovery Group to import from the exportconfig file. See the Matrix recovery management online help for more information on the import procedure for single Recovery Groups.
21
NOTE: Recovery Groups can be imported one at a time only. You must repeat the import...Select import file... procedure for each Recovery Group that you import. If a Recovery Group in the exportconfig file has the same name as a Recovery Group at the importing site, it is not imported. If you are importing a Recovery Group that already exists in the Matrix recovery management configuration but has been modified, you will need to delete the Recovery Group of the same name at the importing site, before you can import the modified version of that Recovery Group. If a Recovery Group in the exportconfig file references a Storage Replication Group that is being used by another Recovery Group at the importing site, the import fails.
5.
22
NOTE: When you use Matrix recovery management to DR protect VMware ESX based IO services that have been deployed from an IO template using an ICVirt template or a VM template on the vCenter server at the Local Site, an ICVirt or VM template with same name must be available at the Remote Site before you perform a Matrix recovery management import operation to import the site configuration. If an ICVirt or VM template with the same name is not available at the Remote Site, DR protected IO services will not be imported. When you use Matrix recovery management to DR protect Microsoft Hyper-V based IO services that have been deployed from an IO template using an ICVirt template on the CMS at the Local Site, an ICVirt template with same name must be available at the Remote Site before you perform a Matrix recovery management import operation to import the site configuration. If an ICVirt template with the same name is not available at the Remote Site, DR protected IO services will not be imported.
6.
7. 8.
Add IO services to Matrix recovery management Recovery Groups. Recoverable IO services can be selected from a drop-down menu when you configure IO services Recovery Groups. For more information, see the Recovery Groups tab in the Matrix recovery management GUI. Export site configuration at Local Site. For more information, see Matrix recovery management export and import operations (page 20). Import site configuration at Remote Site. For more information, see Matrix recovery management export and import operations (page 20).
Configure IO properties
To enable DR Protection for IO services, edit the following hpio.properties and dr.properties files located in C:\Program Files\HP\Matrix infrastructure orchestration\conf directory: hpio.properties Enable DR Protection for IO services: enable.dr.protection = true Specify the datastore used for provisioning DR Protected IO services:
At the Local Site, specify the volumes to use for DR Protected IO services, for example: volume.dr.list = /vmfs/volumes/ds_1;C:\\ClusterStorage\\Volume3;C:\\ClusterStorage\\Volume4 At the Remote Site, specify the volumes to use for replica IO services, for example: volume.dr.replica.list = /vmfs/volumes/ds_1;C:\\ClusterStorage\\Volume3;C:\\ClusterStorage\\Volume4
NOTE: If one datastore is specified in volume.dr.list, the DR Protected IO services are provisioned on the datastore specified. If multiple datastores are specified in volume.dr.list, the DR Protected IO services are provisioned on the datastore in volume.dr.list that is both available for that server pool and also has the most free disk space. If multiple datastores are specified in volume.dr.list, and the IO template specifies one of the datastores in volume.dr.list, the DR Protected IO services are provisioned on the datastore specified in the IO template. Set the static.ip.replica.list property to reserve static IP addresses for replica IO services at the Remote Site. For example if the networks configured in IO at the Remote Site are subnetX and subnetY, one or more IP addresses can be reserved for replica services: static.ip.replica.list = subnetX:192.16.0.10;subnetX:192.16.0.15-192.16.0.20;subnetY:192.16.0.30-192.16.0.40
DR protection for IO services 23
dr.properties On the Local Site and Remote Site CMS specify a name identifying the associated site where IO is running in the dr.properties file, for example: local.site = siteA at the Local Site, and local.site = siteB at the Remote Site. The name in the dr.properties file can be different than the site name configured in Matrix recovery management. If the service owner domain name and username on the Local Site and Remote Site are not the same, set the owner.username.<site_a>.<owner_domain_a\\username_a> = owner.username.<site_b>.<owner_domain_b\\username_b> property to specify the service owner domain and username mapping. For example, if the service owner domain name and username configured for IO on the Local and Remote Sites are domainA\userA and domainB\userB: owner.username.siteA.domainA\\userA = owner.username.siteB.domainB\\userB If the domain names on the Local Site and Remote Site are not the same, set the owner.domain.<site_a>.<owner_domain_a> = owner.domain.<site_b>.<owner_domain_b> property to specify the domain name mapping. For example, if the domain name configured for IO on the Local and Remote Sites is domainA and domainB: owner.domain.siteA.domainA = owner.domain.siteB.domainB
For more information, see the HP Matrix Operating Environment 7.1 Infrastructure Orchestration User Guide available at http://www.hp.com/go/matrixoe/docs.
24
Network configuration
To allow the same IP addresses for primary and replica IO services, when the subnet is spanned between the Local and Remote Site: The Local Site and the Remote Site must define the same static IP range and it must contain the IP range of the primary IO service. At the site with the replica IO service, the IO administrator must specify a list or a range of IPs (IP exclusion list) in the hpio.properties file to avoid IP address conflicts. These IP addresses will be reserved for replica IO services only and cannot be used by any other IO services.
If the subnet is not spanned between the Local and Remote site, the primary and replica IO services can be assigned IP addresses specific to each site. NOTE: Matrix recovery management does not perform DNS updates or update the IP configuration of logical servers associated with IO services during a failover operation. Your Network Administrator is responsible for making the necessary modifications to ensure that network services are available if you configure a logical server to use a different IP address or subnet at each site in the Matrix recovery management configuration.
25
3.
4.
in the Matrix infrastructure service to activate the IO services in the Recovery group. When the test is complete, gracefully shut down the operating systems, then deactivate the logical servers or IO services. NOTE: If the Matrix recovery management configuration includes Hyper-V logical servers or IO services, bring the cluster disk resource used for logical server or IO service storage offline. 5. 6. At the Remote Site, take the Recovery Group out of Maintenance Mode by using the Disable Maintenance Mode button in the Matrix recovery management Recovery Groups tab. At the Local Site, repeat the storage failover, rescan and refresh (if needed) sequences, then activate the logical servers in the Recovery Group using Matrix OE visualization.
When you import a Matrix recovery management configuration at the Remote Site, all Recovery Groups that are imported have Maintenance Mode enabled by default, so they can be tested. Similarly, a Recovery Group created at the Remote Site has Maintenance Mode enabled by default. After testing is completed for all imported Recovery Groups at the Remote Site, deactivate the recovery logical servers or replica IO services belonging to each Recovery Group , then disable Maintenance Mode on each Recovery Group. Recovery Groups that have Maintenance Mode enabled are not included in an Activate operation. HP recommends that you carefully test Recovery Group failover at the Remote Site prior to disabling Maintenance Mode on imported Recovery Groups.
Failover operations
This section explains the difference between planned and unplanned failovers, and provides the procedure to follow in either case.
Planned failover
A planned failover typically involves an expected outage. For example, a planned failover might be necessary to perform scheduled maintenance, or to react to a severe weather forecast. The following section describes the steps necessary to perform a planned failover using Matrix recovery management. A planned failover includes a series of steps performed at both the Local site and the Remote site. A planned failover includes a series of steps performed first at the site you plan to shut down, and then at the other site in the Matrix recovery management configuration. At the site you want to shut down: 1. Shut down the applications and operating system on each Matrix recovery management DR Protected logical server and each server associated with DR Protected IO services. 2. Click on the Deactivate... button and the Deactivate Recovery Groups at the Local Site window will appear. For more information about the Recovery Groups contained in a Recovery Group Set, select the Recovery Group Set and click View Recovery Group. A window will appear displaying the following parameters for each Recovery Group in the Recovery Group Set: Name Status Type Preferred Site Secondary Site Storage Replication Group Name
Failover operations
27
3. 4.
Click the check-box on the left side of the banner of the Deactivate Recovery Groups at the Local Site window to select all of the Recovery Group Sets at the Local Site for deactivation. Click Deactivate Recovery Groups to start the deactivation operation. A window will appear asking if it is OK to proceed. Click OK and you will be directed to the Jobs tab where you can monitor the progress of the deactivation Job. NOTE: If the Matrix recovery management configuration includes Hyper-V logical servers or IO services, bring the cluster disk resource used for logical server or IO service storage offline.
At the site you want to fail over to: 1. Ensure that enough resources are available to run the logical servers that will be activated at this site. 2. Click on the Activate... button and the Activate Recovery Groups at the Local Site window will appear. For more information about the Recovery Groups contained in a Recovery Group Set, select the Recovery Group Set and click View Recovery Group. A window will appear displaying the following parameters for each Recovery Group in the Recovery Group Set: 3. Name Status Type Preferred Site Secondary Site Storage Replication Group Name Start Order Power-Up Delay
4.
Select each Recovery Group Set that you want to activate or click the check-box on the left side of the banner of the Activate Recovery Groups at the Local Site window to select all of the Recovery Group Sets for activation. Click Activate Recovery Groups to start the activation operation. A window will appear asking if it is OK to proceed. Click OK and you will be directed to the Jobs tab where you can monitor the progress of the activation Job.
NOTE: A successful Activate or Deactivate operation ensures that all of the Recovery Groups within a Recovery Group Set are in the same state (activated or deactivated). However, certain operations (for example, a Recovery Group edit to change site preference) may result in some Recovery Groups within a Recovery Group Set being activated and others being deactivated. You must run an Activate or Deactivate operation on a Recovery Group Set to ensure that all of the Recovery Groups in that Recovery Group Set are in the same state ( all activated or all deactivated).
Unplanned failover
An unplanned failover typically involves the occurrence of a site-wide event, without prior warning. This event may be a regional disaster (earthquake, massive flood), or a local problem (power loss or water main leak in the data center). An unplanned failover includes a series of steps performed at both sites in the Matrix recovery management configuration. The following procedure assumes that the CMS and managed resources running DR Protected logical servers survived an unplanned local event (for example, a power
28 Testing and failover operations
loss). If the event is more severe, resulting in the permanent loss of the CMS or managed resources, reconstruction of the site may be necessary. At the site where the site-wide event occurred: 1. Ensure that the DR Protected logical servers are no longer running in order to prevent a split-brain situation. As long as the DR Protected logical servers have stopped running, Matrix recovery management will prevent them from automatically powering up when power is restored. Matrix recovery management is able to prevent split-brain from occurring during an unplanned failover by regulating the auto-power configuration of managed nodes (whether virtual or physical) that are assigned to DR Protected logical servers so they do not automatically power-up after an outage. If, for example, a site loses power and site failover is invoked, the site where the power outage occurred will not resume running the DR Protected logical servers when power is restored. The managed nodes (whether VC blades or virtual machines) assigned to DR Protected logical servers stay powered down (and resources remain unassigned) until an Activate operation is invoked. At the failover destination (recovery) site: 1. Ensure that enough resources are available to run the recovery logical servers. 2. From the Matrix recovery management Sites tab, click the Activate... button and the Activate Recovery Groups at the Local Site window will appear. For more information about the Recovery Groups contained in a Recovery Group Set, select the Recovery Group Set and click View Recovery Group. A window will appear displaying the following parameters for each Recovery Group in the Recovery Group Set including: 3. Name Status Type Preferred Site Secondary Site Storage Replication Group Name Start Order Power-Up Delay
4.
Select each Recovery Group Set that you want to activate at the recovery site. The objective is for all Recovery Group Sets that were previously activated at the site where the site-wide event occurred to be activated at the recovery site. Click Activate Recovery Groups to start the activation operation. A window will appear asking if it is OK to proceed. Click OK and you will be directed to the Jobs tab where you can monitor the progress of the activation Job.
Failover operations
29
NOTE: If the Matrix recovery management configuration has been changed since the failover occurred (for example, a new Recovery Group was created), the sites must be brought into sync by making appropriate configuration changes. The Matrix recovery management Site configuration export and import operations can be used for this purpose. A successful Activate or Deactivate operation ensures that all of the Recovery Groups within a Recovery Group Set are in the same state (enabled or disabled). However, certain operations (for example, a Recovery Group edit to change site preference) may result in some Recovery Groups within a Recovery Group Set being enabled and others being disabled. You must run an Activate or Deactivate operation on a Recovery Group Set to ensure that all Recovery Groups in that Recovery Group Set are in the same state (enabled or disabled).
30
The fluid, two-way movement of workloads across dissimilar servers described in this chapter is different from the movement enabled by traditional migration tools. Those tools are oriented towards enabling a one-way, permanent or semi-permanent migration, between physical and virtual or between dissimilar physical servers. The movement typically requires manual intervention and a relatively long period of time to complete. The importance of the ability to fluidly move a workload from a physical server to a virtual machine and back can be understood from the following examples: You want to move your online workload running on a physical server during daily off-peak hours to a virtual machine host, to free up the physical server to run a batch workload. When the off-peak period is over, the batch workload is retired and the online workload is moved back to its original execution environment. During the time the online workload is parked on a virtual machine host, it has minimal resource requirements; hence it has minimal impact on other workloads that may be running on that host. Because this pattern repeats daily, the physical to virtual and virtual to physical moves must be achieved quickly (in minutes, rather than hours) and automatically. You have two data centers located at two different sites. The production workloads run on physical servers at one site and are configured to be failed over to the other (recovery) site, in case of a disaster. The recovery site is equipped with a set of servers configured as virtual machine hosts. In this use case, planned or unplanned failovers require physical to virtual and virtual to physical moves across sites. The configuration of the recovery site as a set of virtual machine hosts may be driven by the needs of test and development activities that are carried out at that site. Or, it may be driven by a need to reduce the cost of disaster recovery by running the workloads on virtual machines hosted by a smaller set of servers. Recovery time objectives require that the moves be achieved quickly and automatically, as in the previous example.
31
32
The target WWN values used to present the Logical Unit must be the same across virtual and physical targets. NOTE: The recovery logical server that provides DR protection at the Remote Site has its own set of target WWN/LUN values that differ from the target WWN/LUN values for the logical server at the Local Site.
The network name used by an ESX Host must match the network name used in the Virtual Connect Enterprise Manager configuration, as displayed in Figure 5 (page 34) and Figure 6 (page 34).
33
When moving a logical server between physical and virtual servers within a site, the following server IDs are not preserved:
Network MAC addresses Server/Initiator WWNs (On a virtual machine, the storage adapter is a virtual SCSI controller.) Logical Serial Number Logical UUID
In a DR configuration, the site that you configure first must have both physical servers and virtual machine hosts available so a logical server can be configured to run and be tested on
34
both types of servers. The recovery site can have a physical/virtual combination also, or have only virtual machine hosts.
Supported platforms
The procedures for enabling movement between physical and virtual servers described in this chapter apply to physical servers, hypervisors, and workload operating systems supported by Matrix recovery management. For more information, see the HP Insight Management 7.1 Support Matrix at http://www.hp.com/go/matrixoe/docs. For supported hypervisors, see VMware Hypervisor versions specified as supported on managed systems by the Matrix Operating Environment. For supported workload operating systems, see Windows 2008 versions specified as supported on managed systems by the HP Matrix Operating Environment.
The procedures for enabling movement between physical and virtual servers are not supported on Integrity managed nodes. The procedures for enabling movement across different physical servers documented in this chapter are supported for managed systems specified as supported by the HP Matrix Operating Environment, with the following restriction: Matrix recovery management, the component of the HP Matrix Operating Environment that provides disaster recovery across sites, does not support Integrity managed nodes.
A physical server target configured for cross-technology movements must be an HP c-Class blade with HP Virtual Connect.
Configuring logical servers for movement between physical and virtual targets
1. Prepare a logical server with a portable image. Start with a logical server configured to run on a physical server, and prepare its system image for movement between physical and virtual servers. a. Storage configuration The Portable Images Storage Assistant (PISA) tool prepares the storage configuration of the server image so it can be booted in both physical and virtual environments. PISA is part of the HP Insight Control server migration product on the HP Insight Management DVD. The executable and README are in the <SMP>\PI\PISA folder, where <SMP> is the directory where Insight Control server migration is installed (the default install directory is C:\Program Files\HP\Insight Control server migration). i. Copy the executable hppisa.exe (under PI\PISA) to the physical server where the image is currently running. ii. In the command-line window, type: > hppisa e For more information, see Portable Images Storage Assistant (PISA) (page 38). b. Network configuration Portable Images Network Tool (PINT) prepares the image to execute on targets with different network interface configurations and MAC addresses. It ensures that the static network configuration from the source server is successfully transferred to destination server network interfaces despite the differing environment. The executables and README are in the folder <SMP>\PI\PINT.
35
i. ii. 2.
Copy the executable cp011231.exe to the physical server where the image is currently running. Run cp011231.exe to install PINT and start the PINT service.
For more information, see Configuring and managing portable OS images (page 38). Create a portability group that includes all potential physical and VM host targets. This step sets up the portability group that defines the list of potential targets for the logical server. The group should include both physical servers and VM hosts as targets. See Figure 7 (page 36). Figure 7 Creating a portability group
For more information, see Portability groups (page 39). 3. Configure the logical server for activation on both physical and VM host targets. Modify the logical server configuration as follows: In the Create logical server: identity screen, set the portability group of the logical server to the portability group created in Step 2. In the Create logical server: storage screen, select a VM datastore (this datastore will be used to store VM configuration information). For more information, see Defining cross-technology logical servers (page 41). 4. Present the boot/data storage to the VM hosts in the portability group using the same LUN and WWN values. For more information, see Storage definition (page 43).
Perform physical to virtual and virtual to physical movements to verify OS configuration. Move the logical server from physical to virtual and back again. After each move, ensure that the logical server has booted successfully and the static network configurations have been applied as expected to the network interfaces present in the new environment. For more information, see Moving between technologies (page 44).
36
NOTE: When the logical server is first moved to a virtual machine, you may want to add additional tools to the server, for example, VMware tools. In the HP Matrix Operating Environment, the VM configuration created does not include a virtual CD/DVD drive. You can use the VM management console to modify the VM configuration to include a virtual CD/DVD drive. 5. Configure inter-site movement between physical and virtual targets (disaster recovery use case). In this step, Matrix recovery management capabilities are used to set up inter-site movement between physical and virtual targets. a. At the Local Site, create an array replication group containing the LUNs used by the logical server for boot/data and the LUN containing the VM datastore as illustrated in Figure 8 (page 37). The VM datastore should be used exclusively to store the VM configuration of the logical server. Figure 8 Creating an array replication group
b. c. d. e. f.
g. h.
i.
At the Local Site, create a Recovery Group containing the logical server. Export the Matrix recovery management configuration to a file. Deactivate the logical server at the Local Site and failover the array Replication Group to the Remote Site. Perform VM host rescan and Matrix OE visualization refresh procedures to ensure that the VM configuration datastore is accessible to the logical server configuration. Create a portability group at the Local Site. The portability group can contain physical servers, VMs, or a combination of both. Create the recovery logical server with the same configuration values used at the Local Site, but adjusted to point to the remote storage LUN/WWN values. Do not activate the logical server at this time. Using the exported configuration from the Local Site, import the Recovery Group that was created at the Local Site to contain the recovery logical server. The Recovery Group is in Maintenance Mode. Activate the logical server on a VM host. If one or more physical servers are available in the portability group, perform virtual to physical and physical to virtual movements. At each stage, check for successful boot and confirm correct network configuration, as in step 4. Deactivate the logical server and clear Maintenance Mode. Fail the array replication group back to the Local Site. At the Local Site, perform rescan and refresh procedures, and activate the logical server.
Overview of physical to virtual cross-technology configuration 37
NOTE: The Matrix recovery management Site configuration can be set up to preferentially activate the logical server on a physical server at one site and a VM host at the other site. For more information, see Setting a failover target type preference (page 46).
The HP Matrix Operating Environment supports mechanisms to prepare a server image (logical server) so it continues to work when moved to a different physical or virtual server. The following areas are key to achieving this objective: Driver installation HBA configuration
NIC configuration The tools developed for this purpose are as follows: Portable Images Storage Assistant (PISA) Portable Images Network Tool (PINT)
38
PISA is a simple command-line tool that accepts only a few command-line options. It needs to be executed only once after Windows has been installed on a physical server. The changes it makes are persistent and do not need to be repeated or reversed. However, repeatedly running PISA has no negative impact. PISA can also be used to disable the driver used by the virtual machine. The command-line interface for PISA is described below. The options are mutually exclusive. PISA runs on supported versions of Windows only, and it requires that the user be a member of the Administrator user group.
Usage: hppisa -h, -?, -help Show this information -e, -enable Enable the LSI driver -d, -disable Disable the LSI driver
After these changes are made, the OS image can be moved back and forth between physical servers and virtual machines.
Portability groups
When you create a logical server, you must specify the portability of the logical server. You do this by selecting a portability group from within the identity page of the logical server configuration wizard. After a logical server is associated with a particular portability group, it can be moved to any target system (HP Virtual Connect physical server or virtual machine hypervisor) within that portability group. Logical server resource constraints, such as CPU/memory requirements and network/SAN connectivity, are evaluated solely within the context of the portability group that the logical server is associated with. Portability groups come in two classes: Default and User Defined.
39
The HP Matrix Operating Environment provides default portability groups depending on the resources found within your data center. The Default portability groups include: ESXAll ESX Hypervisors HYPERVAll Hyper-V Hypervisors Each Virtual Connect Domain GroupEach VCDG has its own Default portability group.
You can also create User Defined portability groups that extend the portability of a logical server to unlike technologies. For example, moving logical servers between a Virtual Connect physical server and a VMware ESX virtual machine host. User Defined portability groups are defined by selecting ModifyLogical Server Portability Groups in Matrix OE visualization as illustrated in Figure 9 (page 40). Figure 9 Modifying logical server portability groups
If you have selected one or more targets in Matrix OE visualization, they are presented as potential targets. Otherwise, all resources are presented. Figure 10 (page 41) provides an example of the Modify Portability Group screen.
40
Provide a name and optional description for the portability group. The name will be used for defining logical servers. The set of Group Types is selected automatically based on the targets inserted into the portability group. Valid combinations of targets include: A single Virtual Connect Domain Group (VCDG) A set of ESX Hypervisors A set of Hyper-V Hypervisors A set consisting of a single VCDG plus a set of ESX Hypervisors.
41
To view the portability group for any logical server, click the View movable logical server details icon in Matrix OE visualization as illustrated in Figure 12 (page 42). Figure 12 View movable logical server details icon
The details for this logical server are displayed as illustrated in Figure 13 (page 42). Figure 13 View logical server details
42
Logical servers can be made portable through techniques described in Portability groups (page 39). NOTE: You must determine whether the provisioned operating system within a logical server performs as desired on a variety of platforms. If a logical server has never been active on a platform type, the HP Matrix Operating Environment shows a warning for each target of that type in the Target Selection page during moves and activations. You must determine whether the target is valid.
Storage definition
Storage can be defined through Storage Pool Entries or Storage Entries tied directly to a logical server. The storage for cross-technology logical servers must be SAN-based. This approach uses the normal SAN-boot approach within Virtual Connect and leverages ESX RDM technology which presents boot and data LUNs directly to the virtual machine. For examples, see Figure 14 (page 43) and Figure 15 (page 43). Figure 14 Storage Pool Entries
43
When defining storage for a portable logical server, you must select SAN Storage Entry. For flexibility and movement between underlying technology types, storage must be presented to the WWNs tied to the Virtual Connect server profile, and storage must also be presented to any ESX VM hosts that are potential targets for the logical server. Storage Pool Entries must be created within the same portability group associated with the logical servers that will use the storage as illustrated in Figure 16 (page 44). Figure 16 Managing Storage Pools
44
Targets for a logical server are selected from that logical server's portability group. The portability group members are then further filtered based on resource availability, including CPU and memory resources as well as network and SAN connectivity. NOTE: Networks in Virtual Connect must be named identically to their corresponding networks (port groups) on ESX Hypervisors. Differences in names prevent the Unlike Move operation from identifying networks with similar connectivity.
Target attributes
You can track where a logical server has been successfully activated or moved by using logical server target attributes. Target attributes provide a greater number of most suitable targets where you can activate or move a logical server. To view or modify target attributes on a logical server, select the logical server and then click ModifyLogical Server Target AttributesManage as illustrated in Figure 18 (page 45). Figure 18 Modifying logical server target attributes
Types of targets can be selected and added or removed from the logical server's target attributes. Selecting a server from the list below and clicking Add adds that type of server with associated resources to the list of most suitable targets for the logical server. Selecting a type of server from the Target Attributes Available to Remove list and clicking Remove causes the type of server specified to no longer be listed as most suitable. Figure 19 (page 45) illustrates the screen where you can manage logical server target attributes. Figure 19 Managing logical server target attributes
45
46
You must specify the target type preferred for all sites on the CMS at each site: If you specify Virtual as the target type preferred for a site, all cross-technology logical servers whose Recovery Groups prefer that site are activated on VM hosts during an Activate operation at that site. A physical server is chosen only if no VM hosts are available. If you specify Physical as the target type preferred for a site, all cross-technology logical servers whose Recovery Groups prefer that site are activated on physical servers during an Activate operation at that site. A VM host is chosen only if no physical servers are available.
By specifying appropriate target preferences for sites, you can set up a configuration where DR Protected cross-technology logical servers run on physical servers at one site and on VM hosts at the other site. For example, you might choose to have DR Protected cross-technology logical servers run on physical servers at the preferred site, but run on VM hosts when they failover to the secondary site.
47
Limitations
Only IO services that include virtual servers and on-premise (not cloud) resources are supported. A Recovery Group can contain logical servers only, or IO services only, but not a mix of logical servers and IO services. Replica IO services cannot be flexed (add disk, add server, remove server). Replica IO services cannot be imported to create primary services in a Matrix recovery management configuration.
Minor issues
Firefox browser cannot be used for site export operations
Firefox browser cannot be used to perform Matrix recovery management site configuration export operations, due to a bug in Adobe Flash Player 10.3. For more information, see Bug 2980517 on the Adobe web site. Suggested actions Use Internet Explorer 8 or later to perform Matrix recovery management site configuration export operations.
48 Issues, limitations, and suggested actions
ESX configuration setting required for VMFS datastores of Matrix recovery management managed logical servers to be visible at Remote Site
Under the following conditions, Matrix recovery management requires a specific ESX configuration setting to retain the signature of a VMFS datastore so it will be visible at the Remote Site: You have asymmetric HP P6000 Continuous Access Software array models at the Local and Remote Site. You are using HP P9000 Continuous Access Software storage arrays at the Local and Remote Site. You are using HP 3PAR storage systems. For ESX3.X hosts, set Lvm.DisallowSnapshotLun to 0 using Virtual CenterConfigurationAdvanced Settings. For ESX4.X hosts, to mount the Local Site datastore with an existing signature, see Mount a VMFS Datastore with an Existing Signature in the ESX Configuration Guide Update 1, ESX 4.0, vCenter Server 4.0 available at http://www.vmware.com/pdf/vsphere4/r40_u1/ vsp_40_u1_esx_server_config.pdf. For additional information, see the VMware Knowledge Base at http://kb.vmware.com/kb/1015986.
Suggested actions
recommends as a best practice that you keep LUN numbers the same for corresponding disks across sites. Suggested actions Assess the impact of these discrepancies on any licensing arrangements in use for the operating system and applications running on DR Protected logical servers.
One RAID Manager instance per HP P9000 Storage Management Server and One RAID Manager instance per HP P9000 device group
Each HP P9000 Continuous Access Software Storage Management Server configured in Matrix recovery management has only one HP P9000 RAID Manager Software instance managing the HP P9000 device groups. Each HP P9000 Continuous Access Software Storage Replication Group configured in Matrix recovery management is managed by only one HP P9000 RAID Manager Software instance at each site. Suggested actions: There is no workaround for this issue.
One active Matrix recovery management configuration operation at any point in time
If multiple users attempt to run configuration operations in Matrix recovery management, only one operation succeeds. All other configuration operations receive an error message indicating that some other configuration operation is in progress. Suggested actions There is no workaround for this issue.
Site delete operation in Matrix recovery management does not remove HP SIM tools
If HP P9000 Continuous Access Storage Management Servers are configured and if a site delete operation is performed, the HP SIM tools used to manage HP P9000 storage replication on the local Storage Management Server are not deleted. Suggested actions The mxtool command can be run manually to remove the HP SIM tools. The names of the tools are Insight Recovery_Failover and Insight Recovery_Group validation. For example, if the name of the Storage Management Server is stgmgmtA.cup.hp.com, the names of the tools are STGMGMTA_CUP_HP_COM_Insight Recovery_Failover and STGMGMTA_CUP_HP_COM_Insight Recovery_Group validation.
50
6 Troubleshooting
This chapter provides troubleshooting information in the following categories: Configuration troubleshooting (page 51) Configuration error messages (page 53) Warning messages (page 56) Matrix recovery management Job troubleshooting (page 57) Failover error messages (page 60) Matrix recovery management log files (page 61) DR Protected IO serivces troubleshooting (page 61)
Configuration troubleshooting
To troubleshoot Matrix recovery management configuration operations, take note of any error messages that appear, then review this section for relevant information. You can also view the mxdomainmgr log files for additional information. The following troubleshooting issues are addressed in this section: Unable to add or edit Site information Possible causes include:
The local or remote CMS name provided is not valid (not a fully qualified domain name or locatable in the DNS). The local or remote CMS name does not include a fully qualified name associated with the local host.
Unable to add or edit Storage Management Server information Possible causes include:
The Storage Management Server is not discovered in the HP Matrix Operating Environment user interface. The credentials associated with the Storage Management Server in the HP Matrix Operating Environment user interface do not include the user name provided as part of a Storage Management Server configuration that has been added or edited.
The HP Matrix Operating Environment user interface is unable to communicate with the Storage Management Server. Communication Failure with Storage Management Servers may result if Open SSH is not being installed or configured on the target system. Unable to add or modify HP P6000 Storage Management Server Possible causes include:
The CIMOM Server is not running on the Storage Management Server. The CIMOM server is configured to use a different port than the one specified for add or edit operations. The specified user does not have valid login credentials on the Storage Management Server with appropriate privileges.
Configuration troubleshooting
51
Unable to add or edit HP P6000 Storage Replication Group Possible causes include:
Matrix recovery management is unable to obtain Storage Replication Group information from Command View servers to validate the Storage Replication Group information provided by the user.
Unable to add or edit HP P9000 Storage Replication Group Possible causes include:
The Storage Replication Group is not configured to be managed by the RAID manager instances. The RAID manager service is not running on the local Storage Management Server. The parameters entered for the Storage Replication Group (local array, remote array, serial number, and replication mode) do not match the Storage Replication Group information on the disk array.
Unable to add or modify HP 3PAR Storage Management Server Possible causes include:
The password file for the HP 3PAR storage system is missing from the /storage/3par/ conf directory where Matrix recovery management is installed. The password for the HP 3PAR storage system has been changed since the password file was created.
Unable to add or edit HP 3PAR Storage Replication Group Possible causes include:
The Storage Replication Group is not configured to be managed by HP 3PAR remote copy. The following information entered for the Storage Replication Group cannot be validated with the HP 3PAR storage system configuration: Volume group name on the local HP 3PAR storage system Volume group name on the remote HP 3PAR storage system Local array remote copy group name Remote array remote copy group name Local HP 3PAR storage system serial number Remote HP 3PAR storage system serial number Replication mode
Matrix recovery management logical server configuration is inconsistent with Matrix OE logical server management logical server configuration Possible causes include:
The HP Logical Server Automation service was not running when Recovery Groups were configured in Matrix recovery management. The logical server was being actively managed when the Matrix recovery management configuration was invoked (logical server modification, logical server activation, or logical server deactivation was in progress).
52
Troubleshooting
An Activate, Deactivate, or Import operation is in progress. Another configuration operation may be in progress
Unable to import Storage Management Servers as part of an import operation Possible causes include:
The Storage Management Server was not discovered in the HP Matrix Operating Environment user interface. The credentials associated with the Storage Management Server do not include the user name specified in the Matrix recovery management configuration at the Local Site. The HP Matrix Operating Environment user interface is unable to communicate with the Storage Management Server specified in the Matrix recovery management configuration at the Local Site.
The import file is not valid. The HP Logical Server Automation service is not running. There were recovery logical servers in an active state at the time of the import.
Cause Action
Error: Matrix recovery management is quiesced. No configuration changes are allowed at this time. The user attempts to change the configuration of Matrix recovery management when the system has been quiesced. Wait for Matrix recovery management to be unquieced and retry the configuration change.
You are not authorized to access this page. The user is not authorized to use Matrix recovery management , or the local CMS hostname cannot be resolved by the DNS. Check the HP SIM and Matrix recovery management log files for details, then contact the network administrator to add the local CMS hostname to the DNS.
53
Cannot verify the host name specified. The hostname specified for the CMS for either the Local Site or the Remote Site is not locatable in the DNS. Verify that a valid DNS entry with a fully qualified domain name exists for each CMS.
Cannot create/edit the site information. The hostname specified for the CMS does not include a fully qualified domain name associated with the local CMS. Ensure that the CMS for the Local Site has the fully qualified domain of the local host.
Storage Management Server not discovered by CMS. Discover server and retry. Each Storage Management Server to be configured in Matrix recovery management must be discovered in the HP Matrix Operating Environment user interface with appropriate credentials specified. Ensure that the Storage Management Server is discovered in the HP Matrix Operating Environment user interface. If not, discover the server by using OptionsDiscovery in the HP Matrix Operating Environment user interface.
Action
Error: Invalid storage manager username and/or domain name. The login credentials for the server stored in the HP Matrix Operating Environment user interface do not include the user name specified as part of the Storage Management Server configuration operation in Matrix recovery management. For the server specified, ensure that the login credentials stored in the HP Matrix Operating Environment user interface include the credentials for the user name specified.
Action
Error: Failed to add/modify EVA Storage Management Server. Credential for Storage Management Server does not exist. Check input and retry. The hostname, port number, and user name specified cannot be validated. Ensure that the server identified by the hostname is an HP P6000 Command view server. Ensure that CIMOM on the Command View server is configured to use the port number specified and that the user name specified is a valid user on that server.
Unable to add/edit Storage Management Server The server is not properly configured to be managed by the HP Matrix Operating Environment user interface. Ensure that Open SSH is installed and configured on the server, and ensure that the managed nodes and CMS are trusted. This can be achieved by running ConfigureConfigure or Repair Agents in the HP Matrix Operating Environment user interface.
Unable to add/edit EVA Storage Replication Group Unable to obtain information about the Storage Replication Group from the HP P6000 Command View Server. Confirm that the Storage Replication Group exists on the arrays that were listed when the HP P6000 Storage Replication Group configuration was added or edited in Matrix recovery management. Ensure that the HP P6000 arrays are managed by HP P6000 Command View servers. 1. Confirm that the SIM-S component of HP P6000 Command View has been installed on the HP P6000 Command View server. For information on installing SMI-S, see the HP P6000 Command View Software Installation Guide available at http://h20000.www2.hp.com. Click on Manuals,
54
Troubleshooting
then go to StorageStorage SoftwareStorage Device Management SoftwareHP P6000 Command View Software. 2. Confirm that the port number specified during Storage Management Server configuration in Matrix recovery management is the same as the WBEM port number configured on the HP P6000 Command View server (for example, 5989). For more information, see the CIMOM server configuration section in the HP P6000 Command View Software Installation Guide. 3. Refresh the HP P6000 Command View server CIM database by running the refresh command. For details on the Discover command, see the Configuring HP SMI-S EVA to Discover HP Command View EVA Arrays section in the HP P6000 Command View Software Installation Guide. 4. Ensure that the HP P6000 Command View Software service and CIMOM service are running. Start or restart these services if necessary.
Unable to add/edit XP Storage Replication Group Matrix recovery management is unable to obtain information about the Storage Replication Group from the RAID manager, or the information obtained does not match the information provided when the HP P9000 Storage Replication Group configuration was added or edited in Matrix recovery management. Ensure that the RAID manager instance on the local Storage Management Server is running. From the install directory of the RAID Manager, run pairdisplay g <group name> (where <group name> is the name of the Storage Replication Group being added or edited) to see if the Storage Replication Group is configured and the local array serial number, remote array serial number, and storage replication type matches the data provided when the HP P9000 Storage Replication Group configuration was added or edited in Matrix recovery management.
Action
Unable to add CLX credentials for this HP 3PAR storage system. The encrypted password file for the corresponding HP 3PAR storage system is incorrect or the password for the storage system has been changed and cannot be authenticated using the existing password file. Ensure the correct password file is present in the /storage/3par/conf directory where Matrix recovery management is installed for both the local and remote HP 3PAR storage systems. If necessary, regenerate the password files. Retry the Storage Management Server Add operation.
Action
Error: Unable to locate CLX/3PAR install path. Check if CLX/3PAR is installed on the system. The required software HP 3PAR Cluster Extension Software is not installed, or the CLI version does not match the one in the hp_ir.properties file. Install only the CLI component of the HP 3PAR Cluster Extension Software by using the custom install option, or update the INFORM_CLI_VERSION property in conf\hp_ir.properties where Matrix recovery management is installed on the CMS.
Unable to validate this 3PAR Storage Replication Group. The parameters entered cannot be validated against those configured on the HP 3PAR storage system. Verify and correct the parameters (including volume group name on the local HP 3PAR storage system, volume group name on the remote HP 3PAR storage system, local array remote copy group name, remote array remote copy group name, local and remote HP 3PAR storage system serial numbers, and replication mode) and retry the operation.
55
Error message
Unable to run Matrix recovery management operations because Matrix recovery management Job is in progress or another Matrix recovery management configuration operation is in progress. If an Activate or Deactivate operation is in progress, no configuration operation is allowed, because the Job is in progress. If a Matrix recovery management configuration operation is in progress, no other Matrix recovery management configuration operations are allowed. Ensure that the failover process is not taking longer than usual and that the backend job process hp_lsdt_automation.exe is running. Wait for the Activate or Deactivate operation to finish.
Cause
Action
Unable to import Storage Management Servers. A Storage Management Server is not properly configured in the HP Matrix Operating Environment user interface. Ensure that the Storage Management Server is discovered in the HP Matrix Operating Environment user interface. Ensure that Open SSH is installed and configured on the Storage Management Server. If not, run ConfigureConfigure or Repair Agents from the HP Matrix Operating Environment user interface. Ensure that the credentials for the user are also the login credentials for the server in the HP Matrix Operating Environment user interface.
Import Failed. Possible causes include an invalid import file, HP Logical Server Automation service is not running, or one or more recovery logical servers are in an active state. Ensure that a valid file exported from the Local Site is used to import the Matrix recovery management configuration at the Remote Site. Confirm that the HP Logical Server Automation service is running on the CMS. Ensure that all recovery logical servers to be managed by Matrix recovery management are in a deactivated state at the time of the Matrix recovery management configuration import.
Import succeeded but not all storage managers have been fully configured. Check Matrix recovery management log files for details. Credentials were not configured for one or more remote Storage Management Servers. 1. Look at the mxdomainmgr.*.log file to find the remote Storage Management Servers with credentials that were not configured. 2. Make sure that the Storage Management Servers are up and healthy. 3. Discover the Storage Management Servers on the CMS. 4. Click on the Matrix recovery management user interface Storage Management Servers tab. 5. Select the Storage Management Server that you want to configure, and click Edit. 6. Select the Refresh SIM Password box and click Save.
Warning messages
This section lists Matrix recovery management warning messages.
Warning message Cause Action Warning: Matrix recovery management is being quiesced. Currently running operations will be allowed to complete. No new operations can be started. Matrix recovery management is being quiesced. All configuration buttons (Create, Edit, Delete, etc) are disabled. Wait for Matrix recovery management to be unquiesced.
56
Troubleshooting
Warning: Matrix recovery management is quiesced. No new operations will be allowed. Matrix recovery management has been quiesced. All configuration buttons (Create, Edit, Delete, etc) are disabled. Wait for Matrix recovery management to be unquiesced.
Warning message
Warning: Unable to remove CLX credentials for <storage_management _server_name> (these server credentials may not exist in CLX). Matrix recovery management will continue to delete <storage_management _server_name>. HP recommends that you check and manually remove the CLX credentials. Failure to clean a CLX credential after successfully deleting a Storage Management Server. To check and clean a CLX credential, open a command prompt window on the CMS: Change to the CLX bin directory. For example, cd c:\Program Files\Hewlett-Packard\Cluster Extension 3PAR\bin Execute CLX3PARCONFIG array /remove name=<storage_management_server_name> Check the command output message.
Cause Action
To identify the subsystem where the failure occurred, review the remainder of this section for information on which log files you need to look at and what action you need to take. For additional information, you can view the lsdt.log file in the logs directory where Matrix recovery management is installed on the system. A Job in Matrix recovery management is an automated, multistep process that activates or deactivates selected Recovery Group Sets on a site, imports a specific Recovery Group, or imports a Site configuration. For example, the Job with a Job Id of 3288 shown in Figure 21 (page 58) is
Matrix recovery management Job troubleshooting 57
an Activate Job. It has an Entity of type site and an Operation of type activate. You will also notice the Failed icon in the Status column indicating that Job 3288 has failed. Figure 21 Jobs screen
For a failed Job, click the check box next to the Job Id to get detailed information about the associated Sub Jobs. A site Job contains a Sub Job for each Recovery Group. Similarly, each Recovery Group has Sub Jobs for its Storage Replication Group and logical server, respectively. To troubleshoot a site Job and identify the source of the error, drill down to each of the associated failed Sub Jobs to determine what operation failed and the reason for the failure, as shown in Figure 22 (page 58). Figure 22 Expanded Jobs screeen
For additional information on troubleshooting, see the HP Matrix Operating Environment 7.1 Logical Server Management User Guide and the HP Matrix Operating Environment 7.1 Infrastructure Orchestration User Guide available at http://www.hp.com/go/matrixoe/docs. After the problem has been corrected, from the Jobs screen you can restart the Job by clicking restart in the Status column for the failed Job, as illustrated in Figure 23 (page 59).
58
Troubleshooting
NOTE: Restarting the Job retries only Sub Jobs that previously failed; servers associated with completed Jobs or Sub Jobs are not impacted. IMPORTANT: If correcting the problem that caused the Job to fail included reconfiguration of logical servers, before you restart the Job, go to the Recovery Groups tab and delete the Recovery Groups that contain the reconfigured logical servers. Recovery Groups that have been deleted due to the reconfiguration of logical servers can be recreated with the reconfigured logical servers after the Job is successfully completed. The following troubleshooting issues are addressed in this section: Failover job failed because storage failover failed. Possible causes include:
Storage Management Servers were not available at the time of the failover.
Failover job succeeded but recovery logical servers are not activated. Possible causes include:
Recovery Groups that contain logical servers that are in Maintenance Mode at the Remote Site.
Failover job failed because there are no sufficient licensed physical servers or virtual machines to host the logical servers. Possible causes include:
Physical servers are running other workloads. Hypervisor management software (for example, VMware vCenter Server) is not running on the Remote Site.
Matrix recovery management Job troubleshooting 59
Matrix recovery management job failed because of unlocatable logical server in Matrix OE logical server management. Possible causes include:
A logical server managed by Matrix recovery management was removed from Matrix OE logical server management before it was unmanaged in Matrix recovery management.
Matrix recovery management job failed because an operation failed in Matrix OE logical server management for the logical server. Possible causes include:
Failover job fails because of unlocatable logical server in LSM A logical server may have been deleted using Matrix OE visualization without removing the Recovery Group containing the logical server from Matrix recovery management. Remove the logical server from the Recovery Group in Matrix recovery management and rerun the failover job. Also look at the lsdt.log file located in the logs directory where Matrix recovery management is installed.
Failover job succeeds but certain logical servers are not activated Recovery Groups containing those logical servers may be in Maintenance Mode. Disable Maintenance Mode on the Recovery Groups and rerun the failover operation. Also look at the lsdt.log file located in the logs directory where Matrix recovery management is installed.
Failover job fails because of insufficient servers There may not be enough licensed physical resources capable of hosting the logical servers. Ensure that there are enough physical servers licensed for the HP Matrix Operating Environment available to host the logical servers managed by Matrix recovery management. Also look at the lsdt.log file located in the logs directory where Matrix recovery management is installed.
60
Troubleshooting
61
Matrix infrastructure orchestration Windows service is not running. There are no IO services that are DR protection enabled. All DR protection enabled IO services are already configured in IO services Recovery Groups. Communication failure with IO.
The import file is invalid. The replica service to be created already exists and is running locally. A primary service with the same name already exists. The values of the properties in the hpio.properties and dr.properties files are not correctly set. DR protection is not enabled (enable.dr.protection = false). The datastores specified in volume.dr.replica.list do not match the datastores for the IO service at the Remote Site. User name and domain name mapping is not specified.
Resources (network and storage) are not available. The Matrix infrastructure orchestration Windows service is not running. There is a communication failure with IO.
One or more Recovery Groups in the Matrix recovery management configuration are inconsistent with the Matrix OE logical server management or IO configuration Possible causes include:
The Matrix OE logical server management service, the Matrix infrastructure orchestration Windows service, or both of these services were not running when Recovery Groups were configured in Matrix recovery management. The logical sever or IO service was being actively managed when the Matrix recovery management configuration was invoked (modification, activation, or deactivation was in progress).
62
Troubleshooting
Error message
Error: HP Matrix infrastructure orchestration is down, causing either an empty list of services, or an unfiltered list of available logical servers (which includes the ones managed by Matrix IO services). Restart HP Matrix infrastructure orchestration and retry. When an IO services Recovery Group was created or modified using the Matrix recovery management GUI, Matrix recovery management was not able to communicate with IO to obtain a list of available IO services to be included in the Recovery Group. Restart the Matrix infrastructure orchestration Windows service and retry the operation.
Cause
Action
Error: Failed to filter out logical servers managed by Matrix IO services. Restart Matrix IO and retry. When an IO services Recovery Group was created or modified using the Matrix recovery management GUI, Matrix recovery management was not able to communicate with IO to obtain a list of IO services to filter out the VM logical servers that are not already part of an IO service that has DR Protection enabled. Restart the Matrix infrastructure orchestration Windows service and retry the operation.
Action
Error: Failed to import Matrix IO service. A primary service with the same name already exists. During import, a primary IO service with the same name as the imported replica IO service exists in IO. Either delete the primary IO service at the Local Site or recreate the primary IO service with a different name.
Error message
Error: One or more Recovery Groups in the Matrix recovery management configuration were found to be inconsistent with the LSM or Matrix infrastructure orchestration configuration. Please delete them. Check the Matrix recovery management log files to identify the inconsistency that caused the error. Matrix recovery management was not able to communicate with Matrix Operating Environment or IO to confirm that a logical server or IO service is being managed by Matrix recovery management. Delete the Recovery Group containing the logical server or IO service from Matrix recovery management. Confirm that the Matrix OE logical server management service and the IO service are running. Ensure that the logical server and IO service are not being actively modified by another user. Retry adding the Recovery Group in Matrix recovery management.
Cause Action
63
Storage resources are not available. The IO service is in an invalid state for activation. The IO service does not exist. The Matrix infrastructure orchestration Windows service is not running. There is a communication failure with IO.
The Matrix infrastructure orchestration Windows service is not running. The IO service does not exist. The IO service is in an invalid state for deactivation. There is a communication failure with IO.
Service activation/deactivation cannot be performed at this time since the service is in IN_PROGRESS state. The IO service was being operated on when the Matrix recovery management activation or deactivation operation was performed (modification, activation, or deactivation was in progress). Wait for the IO service to exit the IN_PROGRESS state and retry the activation or deactivation operation.
64
Troubleshooting
How to contact HP
Use the following methods to contact HP technical support: See the Contact HP Worldwide website: http://www.hp.com/go/assistance Use the Contact hp link on the HP Support Center website: http://www.hp.com/go/hpsc In the United States, call 1-800-3345144 to contact HP by telephone. This service is available 24 hours a day, 7 days a week. For continuous quality improvement, conversations might be recorded or monitored.
Warranty information
HP will replace defective delivery media for a period of 90 days from the date of purchase. This warranty applies to all HP Insight Management products.
HP authorized resellers
For the name of the nearest HP authorized reseller, see the following sources: In the United States, see the HP U.S. service locator website: http://www.hp.com/service_locator In other locations, see the Contact HP worldwide website: http://welcome.hp.com/country/us/en/wwcontact.html
Documentation feedback
HP welcomes your feedback. To make comments and suggestions about product documentation, send a message to: docsfeedback@hp.com Include the document title and manufacturing part number in your message. All submissions become the property of HP.
Related information
For support, software updates, and additional information on Matrix recovery management and other products used with Matrix recovery management, see the following websites: Matrix recovery management website at http://www.hp.com/go/insightrecovery. HP Insight Orchestration website at http://www.hp.com/go/insightorchestration. HP Matrix Operating Environment website at http://www.hp.com/go/matrixoe. HP Insight Control website at http://www.hp.com/go/insightcontrol. HP Insight Control virtual machine management website at http://www.hp.com/go/vmmanage . HP Insight Control server deployment website at http://www.hp.com/go/rdp. VMware Documentation at http://www.vmware.com/support/pubs.
66
Matrix recovery management white papers Matrix recovery management white papers are available at http://www.hp.com/go/matrixoe/ docs.
Matrix recovery management online help The Matrix recovery management online help provides information on operations that are performed from the Matrix recovery management user interface. It is accessible from the Matrix recovery management user interface and from the Help menu on the HP Matrix Operating Environment home page.
Related information
67
Glossary
bidirectional failover A Matrix recovery management feature that allows Recovery Group Sets to be activated or deactivated at either the Local Site or the Remote Site. At any point in time there can be activated and deactivated Recovery Group Sets at both sites. In the event of a disaster, or to accommodate site maintenance, all of the Recovery Group Sets in the Matrix recovery management configuration can be deactivated at one site, and activated at the other site. HP Systems Insight Manager (HP SIM) Central Management ServerA system in the management domain that executes the HP SIM software. All central operations within HP SIM are initiated from this system. Consistency groups are an important property of asynchronous mode volumes. A consistency group is a group of LUNs that need to be treated the same from the perspective of data consistency (I/O ordering). A consistency group is equal to a device group in the Raid Manager configuration file. Logical servers that can be moved back and forth between being VC hosted (hosted by a physical server) or VM hosted (hosted by a virtual machine). See VC hosted logical server and VM hosted logical server for more information. Cluster shared volumes. A feature within a management application that finds and identifies network objects. In HP management applications, discovery finds and identifies all the HP systems within a specified network range. (Disaster Recovery Group) The HP P6000 Continuous Access Software term for a Storage Replication Group. Logical servers and Matrix infrastructure orchestration services that are managed by Matrix recovery management are referred to as DR Protected (disaster recovery protected) logical servers and IO services. A component of the HP Matrix Operating Environment software that manages and automates operations associated with logical servers, including resource provisioning, startup, and shutdown. HP Systems Insight Manager HP Matrix Operating Environment infrastructure orchestration services. A logical server that is associated with IO services. An automated, multistep process associated with: Local Site logical server A Matrix recovery management Activate or Deactivate operation. A Matrix recovery management Site import operation. A Matrix recovery management Recovery Group import operation.
CMS
consistency group
DR Group DR Protected
HP Matrix OE logical server management HP SIM IO services IO services logical server Job
In a Matrix recovery management configuration, this is the set of managed nodes and corresponding CMS that your browser is interacting with. A logical server is a management abstraction that simplifies and optimizes the provisioning, management, and movement of a server, whether it is a physical server or a virtual machine. A logical server is the complete software image of a server, including operating system, applications, configuration data, and metadata that you create and assign to operate on a physical server or virtual machine. Logical servers are managed by HP Matrix OE visualization. Maintenance Mode is used to test Recovery Groups to ensure they will function properly when an Activate operation is performed. Maintenance mode temporarily stops Matrix recovery management from managing a DR Protected logical server or IO service. One use of maintenance mode is to perform failover rehearsal. By setting Maintenance Mode on a Recovery Group, all of the logical servers and Matrix infrastructure orchestration services in that Recovery Group can be activated through the Matrix Operating Environment. Once you are satisfied with the failover
Maintenance Mode
68
Glossary
rehearsal, the Recovery Group and its corresponding logical servers and IO services can be brought back under the control of Matrix recovery management. Matrix infrastructure orchestration services Matrix recovery management configuration process Matrix infrastructure orchestration services (IO services) quickly provision infrastructure to automatically activate physical and virtual servers, storage, and networking from pools of shared resources. More information on Matrix infrastructure orchestration is available at http:// www.hp.com/go/insightorchestration. The following table lists the six steps in the Matrix recovery management configuration process.
2 3
5 6
N_Port ID Virtualization A failover of all Recovery Group Sets from one site to another site, initiated in anticipation of an imminent disaster or for planned downtime. A portability group is a group of virtual machines or c-Class blades equipped with HP Virtual Connect (or a combination virtual machines and c-Class blades) that a logical server can be moved between. A Recovery Group configuration setting that determines the minimum time delay (in minutes) between the power-up of any two logical servers in a Recovery Group. The actual time delay may be more than the minimum time delay specified. The Preferred Site is the site where you prefer the Recovery Group to be activated unless circumstances require activation of the Recovery Group at the Secondary Site. A primary IO service is the first instance of an IO service that you select for DR Protection in a Matrix recovery management configuration. A replica IO service is a service Matrix recovery management creates during an import operation at a recovery site based on the primary IO service definition in the Matrix recovery management export file. A subnet that is not routed outside the data center and typically contains addresses only in the 192.x.x.x or 10.x.x.x address ranges. A subnet that is accessible to the Internet and cannot contain IP addresses in the 192.x.x.x or 10.x.x.x address ranges. Raw Disk Mapping Matrix infrastructure orchestration services that have been configured so they can be included in a DR Protected Matrix recovery management IO services Recovery Group. A pairing of one or more logical servers and a single Storage Replication Group. A Recovery Group has a Recovery Group Start Order number associated with it. During a site failover, Recovery Groups are failed over from one site to another site in the order specified.
Power-Up Delay
private
69
A set of Recovery Groups that share the same Preferred and Secondary sites. Recovery Groups cannot be activated or deactivated individually. Instead, all Recovery Groups that share the same Preferred and Secondary site must be activated or deactivated as a set. Recovery Group Sets can be selected for activation or deactivation at the Local site. An optional number that specifies the order in which a Recovery Group is to be started during a site failover. Recovery Groups without a start order number are started after all Recovery Groups that have an associated start order number. Logical servers at the Remote Site that are included in a Recovery Group. They are associated with logical servers at the Local Site that are included in the same Recovery Group. Normally they are deactivated and disabled. They are enabled and activated upon site failover. The duplication of components to prevent failure of the SAN solution. A site in a Matrix recovery management configuration that is not the Local Site. A replica IO service is a service Matrix recovery management creates during an import operation at a recovery site based on the exported IO service definition in the Matrix recovery management export file. Logical servers that are associated with replica IO services. A Storage Area Network (or subnetwork) that connects data storage devices with associated data servers. A SAN is typically part of an overall network of computing resources. The Secondary Site is the site where you prefer the Recovery Group to be on standby (in a deactivated state), unless circumstances require activation of the Recovery Group at the Secondary Site. Split brain occurs when two or more instances of the same application are active simultaneously, which might lead to data corruption. A Matrix recovery management feature that allows failover of the logical servers or IO services in a Recovery Group without failing over the Storage Replication Groups that are associated with those logical servers or IO services. As part of the Matrix recovery management configuration process, servers that manage HP P6000, HP P9000, HP 3PAR, or User Defined storage devices must be defined. These servers are referred to as Storage Management Servers. A set of LUNs across which storage replication preserves write-order at the replication target storage array. In HP P6000 Continuous Access Software terminology, this is known as a DR Group. In HP P9000 Continuous Access Software terminology this is known as a consistency group. A Sub Job is component of a Matrix recovery management activate, deactivate, Recovery Group import or Site import Job. For example, a Site activate Job would include a Recovery Group activate Sub Job for each Recovery Group that is activated on that site, and each Recovery Group activate Sub Job would include a logical server activate Sub Job for each logical server in that Recovery Group. A failover of all Recovery Group Sets from one site to another site, initiated in response to an unforeseen event that caused an outage at the site where the Recovery Group Sets where activated. Matrix recovery management provides a User Defined storage adapter interface specification to enable one-step Matrix recovery management failover capability for storage types that are supported by Matrix OE, but not yet integrated with Matrix recovery management. When you install a storage adaptor in your Matrix recovery management configuration for a storage type other than HP 3PAR, HP P6000, or HP P9000, you can define (and edit) Storage Management Servers based on that storage adaptor. These are referred to as User Defined Storage Management Servers in a Matrix recovery management configuration. For example, if you create and install a storage adaptor named EMC, the Storage server type drop-down menu for configuring a Storage Management Server will allow you to select EMC as the storage server type. For more information, see the HP Matrix Operating Environment 7.1 Getting Started Guide available at http://www.hp.com/go/matrixoe/docs. A logical server running on a c-Class blade equipped with HP Virtual Connect.
Recovery Group Start Order recovery logical servers redundant SAN Remote Site replica IO service
Sub Job
unplanned failover User Defined storage User Defined Storage Management Servers
71