Professional Documents
Culture Documents
Attunity Change Data Capture (CDC) delivers up-to-the-minute data and dramatically
reduced resource consumption when used as part of ETL and data synchronization
processes. For enterprises that need to use data stored in mainframe and legacy
data sources as part of their ETL/BI initiatives, Attunity offers a robust and flexible
solution for moving the right enterprise data to the right place at the right time.
Page 1 of 15
Page 2 of 15
Table of Contents
Page 3 of 15
Business Globalization and 24/7 operations. In the past, enterprises could stop online systems
during the night or weekend, to provide a window of time for running bulk ETL processes.
Nowadays, running a global business with 24x7 operations, means smaller or no downtime
windows.
Need for up-to-date, current data. Customer demand, competitive pressure, and improved decisions,
require more up-to-date data. To make the most of BI in todays ever-accelerating business climate,
managers should not be working with last weeks data. Today, decision makers need data that is
updated a few times a day, or even in real-time.
Data volumes are increasing. The more the business grows, the bigger the data volumes in
operational data stores become. Larger data volumes mean increased CPU and network resources
when performing a bulk ETL process, while the bulk extract windows are getting smaller over time.
Cost reduction. Bulk ETL operations are costly and inefficient, as they require more processing
power, more memory, and more network bandwidth. In addition, as bulk ETL processes run for long
periods of time, they also require more administration and IT resource to manage.
To stay ahead of these changing business conditions and increase the value of BI implementations, a
generation of intelligent ETL is required. The power behind it is Change Data Capture:
Change Data Capture (CDC) is an innovative approach to data integration, which is based
on the identification, capture, and delivery of the changes made to enterprise data sources.
Page 4 of 15
No downtime window for ETL. CDC enables organizations to move changes to data while the
operational systems are running, without the need for a downtime window.
Current, up-to-date data. By constantly identifying changes, CDC delivers new data more
frequentlyeven real-timeproviding more current data for enterprise users.
Reduced cost. By moving only the changed data, CDC requires significantly fewer resources for
moving and transforming data. Cost is reduced in hardware, software, and human resources.
Change data capture, in batch or real-time (for next generation ETL processes)
To learn more about the Attunity products, visit us at www.attunity.com or email info@attunity.com.
The rest of this document provides in-depth information about CDC solution patterns and how Attunity
enables enterprises to leverage CDC today for more efficient ETL and more effective BI.
Change Data Capture next generation ETL
Copyright 2004 Attunity Ltd. All rights reserved
Page 5 of 15
Enable consumers of changed data to receive changes quickly, either by asking for the changes in
more frequently (e.g. every hour, or every 20 minutes), or by automatically sending the changes as
soon as they are identified.
Change Capture Agents. Change capture agents are live software components that are
responsible for the identification and capture of changes to the source operational data store. These
agents also prepare the changes for delivery to the target database/application. Change capture
agents are typically built and optimized for specific data stores (i.e. monitoring a journal, or using
dedicated hooks), though generic agents exist as well. To capture changes, CDC agents typically
scrape system or database journals, use hooks or triggers or user exits, to collect changes and
notify the receiving systems.
Change Delivery Mechanisms. Change delivery mechanisms are responsible for the reliable
delivery of changed data to the change consumertypically the ETL tool or program that will
complete the ETL process (i.e. adding the transform and load steps). Change delivery mechanisms
can either use a pull model where the change consumer initiates the request to get the changes; or
a push model, where changes are pushed to the consumer as soon as they are captured.
Operational
Data Source
Data Warehouse
Hooks,
Triggers
Read Changes
(pull)
Change
Data
Capture
ETL Tool or
Program
Send Changes
(push)
Load Data
Load Data
Data Marts
Monitor Journals
Journal/Log
The above diagram provides an overview of a CDC-enabled ETL process. The following paragraphs provide
an overview of two CDC scenarios and the components that take part in the process.
Page 6 of 15
Page 7 of 15
Attunity Server. The server manages the Attunity components on the data server, the client
connections, security, and load balancing.
Attunity Metadata Repository. The repository defines the data models that expose the data source
for direct access or change data capture. For non-relational data sources, Attunity enables to define
mappings to an enhanced relational data model, including import for existing metadata.
Page 8 of 15
Attunity Data Drivers. The data drivers provide standard access to various data sources, relational
and non-relational. The drivers enable to read/write data from operational data sources for bulk data
extraction or for enterprise reporting.
Attunity Change Capture Agents and Change Queue. The change capture agents are live
software components that continuously monitor for changes in data sources and prepare a change
queue, that may be virtual or physical. A virtual change queue is essentially a virtual layer that
reflects an existing log (e.g. CICS logstream) as a change queue, without actually copying the
change records. A physical change queue acts as a staging area and creates a copy of the change
records.
Attunity Standard Client Interfaces. The client interfaces are components that client applications
can use to perform queries using the back-end Attunity drivers or a change queues. These include
ODBC, JDBC, OLE/DB, ADO, and ADO.NET. A change reader can use any one of these interfaces.
Attunity Event Queue Services. The event queue services provide an event delivery mechanism
that uses a message queue transport to send events to event listeners. On the data server, event
publishers can publish events from change queues or from legacy applications, and have them
routed to event listeners on other platforms.
The following diagram describes the high-level Attunity Connect CDC architecture:
(* The Attunity Server and Metadata Repository are omitted for clarity)
Page 9 of 15
VSAM CDC
Wizard uses
the CICS logs
Choosing
tables for
CDC
Mapping
VSAM to a
relational
metadata
model
Page 10 of 15
VSAM-Batch (Mainframe)
Page 11 of 15
Informatica PowerCenter
Cognos DecisionStream
Hummingbird Genio
Attunity ODBC Clients enable immediate interoperability with all of these tools to support metadata browsing,
data extraction and change data capture. By employing the batch-oriented CDC scenario described earlier, it
is easy to set up ETL processes that retrieve changes every hour or every few minutes.
In this case, implementing a CDC process is similar to implementing a traditional bulk ETL processes. To set
it up, Attunity Connect is configured to recognize the change queue as a data source, and users of the ETL
tool can simply extract the records from the change queue, referring to it as the source data store. Each time
the process is run, a batch of changed records will be returned and processed by the ETL tool. Attunity
Connect keeps track of the last changed record read, and the next time the ETL tool reads the change
queue (i.e., select * from myChanges), it receives the next batch of changes that occurred since the last
request.
In addition, Attunity takes care of normalizing non-relational data by virtually mapping it to a relational data
model. This facilitates the processing of non-relational data in ETL tools and makes it easier to transform and
load this data into a relational database as the target data store.
Real-time change data capture is supported by integrating an Attunity Event Router into any of these tools.
Attunity is planning to add support for MQSeries by the end of June 2004, which will facilitate interoperability
with ETL tools that support this popular messaging middleware.
Furthermore, Attunity products enable direct access to operational data stores, as well as federated access
to historical (DW) and operational data. These capabilities complement the BI offerings provided by the
vendors mentioned abovefurther increasing the value of Attunity Connect for the end-user.
Page 12 of 15
The Challenge
The State agency chose to host its HIPAA-compliant solution on a Windows platform, using SQL Server and
BizTalk Server. This presented an immediate challenge in that the States existing healthcare systems run
on a mainframe and use VSAM tables as their operational data store. To guarantee success, both systems
have to remain in sync to ensure data integrity.
The Solution
The State agency implemented an end-to-end solution based on BizTalk Server and Attunity Connects
Change Data Capture modules. The result is a new system that:
Page 13 of 15
Page 14 of 15
About Attunity
Attunity is a leading provider of connectivity solutions for enterprise data and legacy applications. Founded
in 1987 and traded on the NASDAQ exchange, Attunitys worldwide operations support over 1,000 direct end
users including many of the Fortune 1000. Through distribution and OEM agreements with global-class
partners such as Oracle and HP, Attunity-based solutions are deployed on tens of thousands of systems
worldwide.
The Attunity Connect product family provides standards-based access to over 30 data sources on 20
different computing platforms. Attunity Connect engines reside natively on each target platform and provide
enterprise-class integration capabilities such as real-time read/write access, federated data access between
relational and non-relational data sources, bulk data extraction and change data capture.
Attunitys products are available through direct sales and support offices (listed below) as well as distributors
in Japan, S.E. Asia, Europe, and Latin America. For more information, visit www.attunity.com or email
info@attunity.com.
Corporate Headquarters / USA
Attunity Inc.
40 Audubon Road
Wakefield, MA 01880, USA
t +1 (781) 213-5200
f +1 (781) 213-5240
sales@attunity.com
United Kingdom
Attunity (UK) Ltd.
Unit 6
Beacontree Plaza
Reading
RG2 0BS
United Kingdom
t +44(0)118 975 3330
f +44(0)118 975 3005
info-uk@attunity.com
France
Attunity (France) S.A.
51, Blvd. Bessires
75017 Paris
France
t +33 1 53 06 80 80
f +33 1 53 06 80 89
info-france@attunity.com
Israel
Attunity (Israel) Ltd.
8 Hagalim Street
POB 12227
Herzliya 46733
Israel
t +972 9 960 2600
f +972 9 960 2601
info-il@attunity.com
Page 15 of 15