Professional Documents
Culture Documents
W H I T E PA P E R
This document contains Confidential, Proprietary and Trade Secret Information (“Confidential Information”) of
Informatica Corporation and may not be copied, distributed, duplicated, or otherwise reproduced in any manner
without the prior written consent of Informatica.
While every attempt has been made to ensure that the information in this document is accurate and complete, some
typographical errors or technical inaccuracies may exist. Informatica does not accept responsibility for any kind of
loss resulting from the use of information contained in this document. The information contained in this document is
subject to change without notice.
The incorporation of the product attributes discussed in these materials into any release or upgrade of any
Informatica software product—as well as the timing of any such release or upgrade—is at the sole discretion of
Informatica.
Protected by one or more of the following U.S. Patents: 6,032,158; 5,794,246; 6,014,670; 6,339,775; 6,044,374;
6,208,990; 6,208,990; 6,850,947; 6,895,471; or by the following pending U.S. Patents: 09/644,280;
10/966,046; 10/727,700.
Table of Contents
Executive Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2
White Paper
Lack of data loading solution to multiple target systems delaying time to market
• Cannot maintain systems to be “live” and contribute to workload execution
• Unable to smooth peak processing
• Taking days and weeks to move data among production and Q&A systems
4
White Paper
Replication Administration
Teradata Business
Query Intelligence
Dual Load Operational Control Director
Teradata
Demand New York
ETL Chain Manager Headquarters
Teradata System B
(Jersey City)
Figure 1. Teradata Dual Active Architecture
First, at the left, you can see how three data synchronization methods—table copy, replication, and
dual load—help ensure the primary system and secondary system are current and synchronized.
The dual load solution highlighted in blue uniquely enables the extract-transform-load (ETL)
processing of bulk data into the multiple Teradata target systems at any data volume at low
latency. This dual load technique co-exists with other data synchronization techniques such as
table copy and data replication. Table copy helps move, archive, and restore data within tables
across Teradata systems in medium-to-long latency scenarios. Data replication captures changes
in the primary database and applies the in-database changes from the primary system to the
secondary system in low-volume instances. The dual load solution also communicates with
Teradata Multi-System Manager (TMSM) about the status of data loading jobs. TMSM serves as
a centralized management system that performs monitoring, administration, and operational
controls across Teradata systems. Teradata Query Director can route the query using the IP routing
data against multiple applications, including BI systems, Teradata Relationship Manager, and
Teradata Demand Chain Manager, coming from users in multiple locations.
If users are located closer to the primary system than to the secondary system, the query can be
routed to the primary system by the Teradata Query Director. If another user is in the vicinity of the
secondary system, the query can be routed to the secondary system to ensure optimal speed.
reduce overall operating costs.” • Future-proof the data warehousing environment by actively balancing the loads, thereby making
it more reliable, resilient, and robust to handle mission-critical business intelligence and other
applications
Stephen Brobst
Chief Technology Officer, Teradata
6
White Paper
How does it mitigate risk as part of the business continuity and disaster recovery mandate?
• Continue data loading even if one system is unavailable
• Ensure control and transparency over recovery state and restartability
• Demonstrate data loss protection and security from sources to loading
Figure 2. High-Level Solution Architecture: Informatica Dual Load Solution for Teradata
8
White Paper
How does it work? First. Informatica PowerExchange® Adapters secure and sustain direct
connectivity to any data in source systems, whether it’s relational data, mainframe, packaged
applications, data in the cloud, or unstructured and semi-structured data. The data is then
extracted into the Informatica Platform where PowerCenter® Advanced Edition™ with its Metadata
Manager performs lineage analysis and creates a metadata catalog to enhance transparency and
collaboration between IT and the business. The PowerCenter High Availability Option™ configures
multiple backup services and minimizes service disruptions across the entire platform to provide
resiliency, restartability, failover, and recoverability. Within the PowerCenter environment, binary
staging files are maintained and saved for pushing the data into the Informatica Dual Load
Solution environment. The Informatica Dual Load Solution consists of the following capabilities:
Dual Load Staging Adapter
• Defines dual load connections as extensions to the Teradata Parallel Transporter (TPT) adapter
component of PowerExchange for Teradata
• Defines run-time session-level parameters specifying connectivity attributes to external systems
• Makes all data repeatable and staged into binary files for recovery and restartability
All these capabilities together, Informatica’s dual load solution helps IT organizations quickly
design, test, and populate data warehouses to meet stringent SLAs and other business demands.
Based on the Informatica data integration platform, it also uniquely empowers IT to extend current
data warehousing projects and modernize the environment with highly reliable, secure, and
resilient data integration processing.
10
White Paper
Conclusion
Increasing business availability of information is a key concern for many organizations seeking
to become data driven. They are re-examining the IT architecture to ensure that it is designed
to perform a layered strategy to ensure that data availability, recoverability, and protection
are fundamental tenets of supporting business continuity. In response, an increasing number
of IT departments are taking a multisystem approach to the data warehousing architecture
and adopting a more systematic, streamlined approach to access, integrate, and deliver the
freshest, most relevant data to the business. To support this tiered approach, Informatica,
along with Teradata, has introduced the only dual load solution in the market that relies on a
comprehensive, product-based data integration platform. The Informatica Dual Load Solution
for Teradata helps organizations access, integrate, and load the freshest, business-critical data
to construct and maintain the multiple Teradata data warehouses for business continuity and
disaster recovery. This pioneering dual load solution extracts and transforms the data once and
simultaneously loads it into multiple Teradata systems. With Informatica’s dual load solution, an IT
organization can minimize the costs, time, and risks associated with availability and protection of
information, empowering business to tap the mission-critical information infrastructure for superior
performance and continuous availability.
12
White Paper
Learn More
Learn more about Informatica’s EDW solutions at http://www.informatica.com/solutions/
enterprise_data_warehouse. Visit us at http://www.informatica.com or call (800) 653-3871 to
learn more about Informatica and the entire Informatica Platform.
About Informatica
Informatica Corporation (NASDAQ: INFA) is the world’s number one independent provider of data
integration software. Organizations around the world gain a competitive advantage in today’s
global information economy with timely, relevant and trustworthy data for their top business
imperatives. More than 4,100 enterprises worldwide rely on Informatica to access, integrate and
trust their information assets held in the traditional enterprise, off premise and in the cloud.
© 2010 Informatica Corporation. All rights reserved. Printed in the U.S.A. Informatica, the Informatica logo, and The Data Integration Company are trademarks or registered trademarks of Informatica Corporation in the United States and in
jurisdictions throughout the world. All other company and product names may be trade names or trademarks of their respective owners. First Published: August 2010 7188 (09/20/2010)