Professional Documents
Culture Documents
Copyright
Copyright Mark's Work Wearhouse. All rights reserved.
This document is confidential and proprietary to Mark's Work Wearhouse and/or its affiliated or related entities. This document
shall not be duplicated, transmitted, used, or otherwise disclosed, in whole or in part, to anyone other than the organization or
specific individuals whom which this document is delivered on a need-to-know basis. The recipient may only use this
document to assist the recipient in its provision of services to Mark's Work Wearhouse and/or its affiliated or related entities.
These restrictions are applicable to the entirety of this document and all of its constituent parts individually. Mark's Work
Wearhouse reserves the right to require the recipient to return all copies of this document at any time. In the event that the
organization or specific individual to whom this document is given and Mark's Work Wearhouse enter into an agreement
applicable to any of the information contained in this document, in the event of a conflict between such agreement and this
notice, the use, disclosure and return of the copies of this document will be governed by the terms and conditions of such
agreement.
Mark's Work Wearhouse
30, 1035 - 64 Ave. S.E.
Calgary, AB T2H 2J7
TELEPHONE 403.255.9220
FACSIMILE 403.255.6005
WEB SITE WWW.MARKS.COM
1 | Page
Contents
1. About IBM InfoSphere CDC.......................................................................................................3
2. Current specifications for CDC server in Production..................................................................5
3. Major issues faced in CDC so far are as below:..........................................................................5
4. Resolution/immediate actions taken for above issues:................................................................6
5. Major Factors affecting performance of CDC for Oracle:..........................................................6
A.
Disk Space:..................................................................................................................................6
B.
RAM allocation for each instance:..............................................................................................7
C.
Archive log retention period:......................................................................................................7
D.
Maintaining active TCP connections in a network environment................................................7
E.
Local Log reading and Remote log reading................................................................................7
6. Improvement suggestions:...........................................................................................................8
2 | Page
The key components of the InfoSphere CDC architecture are described below:
Access ServerControls all of the non-command line access to the replication
environment. When you log in to Management Console, you are connecting to Access
Server. Access Server can be closed on the client workstation without affecting active
data replication activities between source and target servers.
Admin APIOperates as an optional Java-based programming interface that you can
use to script operational configurations or interactions.
Apply agentActs as the agent on the target that processes changes as sent by the
source.
3 | Page
Command line interfaceAllows you to administer datastores and user accounts, as well
encoding conversions, and other data to propagate to the target datastore engine.
Source database logsMaintained by the source database for its own recovery purposes.
The InfoSphere CDC log reader inspects these in the mirroring process, but filters out
the tables that are not in scope for replication.
Target transformation engineProcesses data and value translations, encoding
conversions, user exits, conflict detections, and other data on the target datastore
engine.
There are two types of target-only destinations for replication that are not databases:
JMS MessagesActs as a JMS message destination (queue or topic) for row-level
operations that are created as XML documents.
4 | Page
mcdcesb4
ap01
mcdcesb4ap
02
Server
IP
OS
Name
(Assigned
(Allocate by Server
d
by team)
Server
team)
cdc01.pd
10.100.1
RHL 6.4
16.59
cdc02.pd
10.100.116
.61
RHL 6.4
vCPU
Memo
ry
(GB)
Dis Services
k
Siz
e
(G
B)
150
1.
Datastage
Services
Tier
2.
CDC
(Change
Data
Capture) Access
Data
15
0
1.
Datastage
Services
Tier
2. CDC (Change
Data Capture) Access
Data
We have above configuration for CDC servers. Access server for CDC is installed in above
location and is utilised by Access manager and management console. For each subscription
we have source and target CDC agent installed in respective servers
ExOracle AIX Server [for Oracle],
I-series server [for DB2] &
Linux box [For Datastage]
Each instance is configured as Datastore in Access Server and then used by CDC
subscription for Replication.
5 | Page
Recovery of Archive logs. [Usually take longer time if logs are backed up in Disc/tape].
Memory cleanup/Addition of memory to instance having issue.
Instance restart to clear out hung connections and to restart respective Datastores.
Set bookmark value to last commit position, Mark table capture and restart
Subscriptions.
Clearing of logs/staging store information/temp files to free memory.
Increase Retention period of Archive logs and increase Global disk space.
Reset journal entry for DB2 instance.
Changing source id/name of subscriptions based on input from IBM.
CDC. The minimum is 1 GB. Although the minimum is 1 GB, prepare for more disk
space since there is a staging store on the source. Use the InfoSphere
CDCconfiguration tool to configure disk space for this quota.
5 GBFor installation files, data queues, and log files .
Global disk quotaDisk space is required on your source system for this quota which is
used to store in-scope change data that has not been committed in your database.
The amount of disk space required is determined by your replication environment
6 | Page
and the workload of your source database. Use the mirror_global_disk_quota_gb system
parameter to configure the amount of disk space used by this quota.
CDC target system:
1 GBThe minimum amount of disk space allowed for the disk quota for each
instance of InfoSphere CDC. The minimum value for this quota is sufficient for all
instances created on your target system. Use the InfoSphere CDC configuration tool
to configure the disk space for this quota.
5 GBFor installation files, data queues, and log files.
Global disk quotaDisk space is required on your target system for this quota which is
used to store LOB data received from your InfoSphere CDC source system. The
amount of disk space required is determined by your replication environment and the
amount of LOB data you are replicating. To improve performance, InfoSphere
CDC will only persist LOB data to disk if RAM is not available on your target system.
Use the mirror_global_disk_quota_gb system parameter to configure the amount of disk
space used by this quota.
InfoSphere CDC may require additional disk space in the following situations:
You are running large batch transactions in the database on your source system.
You are configuring multiple subscriptions and one of your subscriptions is latent. In
this type of scenario, InfoSphere CDC on your source system may persist transaction
queues to disk if RAM is not available.
You are replicating large LOB data types.
Each instance of CDC requires memory for the Java Virtual Machine (JVM). The
following default values for memory are assigned:
1024 MB of RAM Default value for each 64-bit instance of InfoSphere CDC. This will
change depending of no of subscription/usage of instance .
InfoSphere CDC source deployments may require additional RAM in the following
scenarios:
You are replicating large LOB data types with your InfoSphere CDC source
deployment. These data types are sent to target while being retrieved from the
source database. The target waits until all LOBs (for each record) are received before
applying a row. LOBs are stored in memory as long as there is adequate RAM,
otherwise they are written to disk on the target.
You are replicating "wide" tables with hundreds of columns.
You are performing large batch transactions in your source database rather than
online transaction processing (OLTP).
If multiple subscriptions are using same Datastore/CDC instance.
6. Improvement suggestions:
Increase Archive log retention period to minimum of 72 hrs. for Source Oracle
instances (IBM recommendation is 5 days)
We would need to increase disk allocated to Archive directory to accommodate 72
hrs. of Archive logs.
Increase Staging Area disk quota to minimum of 10 GB, where it is less than 10 GB.
MDR is being used as source instance and Target instance, creating additional
overhead for Datastore.
We can create a new instance which will be used as source instance for subscriptions
reading from MDR.
At the end we will be having 19 subscriptions which are using MDR as source and 39
subscriptions writing to MDR from various source databases.
8 | Page
CDC
recommendations.xlsx
9 | Page