Professional Documents
Culture Documents
Table of Contents
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica Resources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica My Support Portal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica Documentation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica Product Availability Matrixes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica Web Site. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica How-To Library. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Knowledge Base. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Support YouTube Channel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Marketplace. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Velocity. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Global Customer Support. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Table of Contents
Table of Contents
Preface
The Informatica Data Quality Repository Migration Guide is written for data quality developers. This guide
assumes that you have an understanding of data quality concepts, flat file and relational database concepts,
and the database engines in your environment. This guide also assumes that you are familiar with the
concepts presented in the Informatica Developer User Guide.
Informatica Resources
Informatica My Support Portal
As an Informatica customer, you can access the Informatica My Support Portal at
http://mysupport.informatica.com.
The site contains product information, user group information, newsletters, access to the Informatica
customer support case management system (ATLAS), the Informatica How-To Library, the Informatica
Knowledge Base, Informatica Product Documentation, and access to the Informatica user community.
Informatica Documentation
The Informatica Documentation team makes every effort to create accurate, usable documentation. If you
have questions, comments, or ideas about this documentation, contact the Informatica Documentation team
through email at infa_documentation@informatica.com. We will use your feedback to improve our
documentation. Let us know if we can contact you regarding your comments.
The Documentation team updates documentation as needed. To get the latest documentation for your
product, navigate to Product Documentation from http://mysupport.informatica.com.
Informatica Marketplace
The Informatica Marketplace is a forum where developers and partners can share solutions that augment,
extend, or enhance data integration implementations. By leveraging any of the hundreds of solutions
available on the Marketplace, you can improve your productivity and speed up time to implementation on
your projects. You can access Informatica Marketplace at http://www.informaticamarketplace.com.
Informatica Velocity
You can access Informatica Velocity at http://mysupport.informatica.com. Developed from the real-world
experience of hundreds of data management projects, Informatica Velocity represents the collective
knowledge of our consultants who have worked with organizations from around the world to plan, develop,
deploy, and maintain successful data management solutions. If you have questions, comments, or ideas
about Informatica Velocity, contact Informatica Professional Services at ips@informatica.com.
Preface
CHAPTER 1
Export Data Quality 8.6.2 repository objects and reference data to the file system in XML format.
Import the reference data as reference tables to the 9.0.1 Model repository and staging area
You complete the migration in the Developer tool by importing the XML package containing the
transformation, mapplet, and mapping XML to the Model repository.
Note: If a Data Quality 8.6.2 object reads a database source, the migration process preserves the database
connection information. You do not need to re-create the database connection in Data Quality 9.0.1.
To migrate from Data Quality 8.6.2 to Data Quality 9.0.1 HotFix 1, run the migration files associated with Data
Quality 9.0.1 HotFix 1.
To migrate from Data Quality 8.6.2 to Data Quality 9.0.1, run the migration files associated with the Data
Quality 9.0.1.
The Informatica Data Quality 8.6.2 repository contains two types of object: projects and plans. The 8.6.2
repository does not store transformation or data source definitions as separate objects. The 8.6.2
repository stores all metadata as XML.
An 8.6.2 repository project is similar to a 9.0.1 Model repository project. Both display user-defined folders
in the repository structure.
An 8.6.2 plan equates to a mapping in the 9.0.1 Model repository. A plan contains a data source and data
target connected by zero or more transformations. It runs in the same manner as a mapping.
The Informatica Data Quality 8.6.2 user creates and runs plans in a client application called Data Quality
Workbench. The application installs with a local repository. Informatica Data Quality 8.6.2 enables remote
clients to connect to an 8.6.2 repository in a client-server manner, but all Informatica Data Quality 8.6.2
repositories are identical.
The 8.6.2 transformation has a direct counterpart in 9.0.1. Informatica Data Quality 9.0.1 includes
transformations that are effectively copies of 8.6.2 transformations. For example, the Merge, ToUpper,
and Rule-Based Analyzer transformations in Informatica Data Quality 8.6.2 become Merge, Case, and
Decision transformations in Informatica Data Quality 9.0.1.
9.0.1 transformations provide equivalent functionality to or have evolved from 8.6.2 transformations. For
example, the 9.0.1 Comparison transformation combines the functionality of the Bigram, Jaro, Hamming
Distance, and Edit Distance transformations. These 8.6.2 transformations convert seamlessly to a
Comparison transformation.
The 8.6.2 transformation does not have a direct counterpart in 9.0.1 but the transformation functionality is
maintained in other transformations. In such cases, the 8.6.2 transformation metadata transfers to other
transformations. For example, the Word Manager transformation does not migrate to 9.0.1, but its
metadata transfers to the Standardizer transformation, which enables the same functionality.
The 8.6.2 transformation is not supported in 9.0.1 and the transformation functionality does not transfer to
other transformations. In such cases, the 8.6.2 transformation input and output metadata is applied to
another transformation, for example an Expression transformation.
Sources and targets used in grouping data records before duplicate analysis.
10
8.6.2 Component
9.0.1 Component
Aggregation
Aggregator transformation
Association transformation
Bigram
Comparison transformation
Character Labeler
Labeler transformations
Consolidation [for
PowerCenter]
Consolidation transformation
Context Parser
Labeler and Parser transformations. The Parser transformation is set to patternbased parsing mode.
Count
8.6.2 Component
9.0.1 Component
CSV Target
CSV Source
DB Match Source
DB Report Target
DB Target
DB Source
Edit Distance
Comparison transformation
Global AV [SDK]
Not supported
Group Target
Group Source
Hamming Distance
Comparison transformation
Identity Match
Match transformation
Jaro Distance
Comparison transformation
11
8.6.2 Component
9.0.1 Component
Merge
Merge transformation
MinAvgMax
Missing Values
Not supported
Normalization [SDK]
Not supported
NYSIIS
Parsing [SDK]
Not supported
Profile Standardizer
Range Counter
12
Realtime Target
Realtime Source
Report Target
Decision transformation
SAP Target
Not supported
SAP Source
Not supported
Scripting
Not supported
Search Replace
Standardizer transformation
Similarity [SDK]
Not supported
Soundex
Splitter
Labeler, Parser, and Expression transformations. The Parser is set to patternbased parsing mode.
Sum
To Upper
Token Labeler
Labeler transformation
Token Parser
Parser transformation
8.6.2 Component
9.0.1 Component
Word Manager
Standardizer transformation
Reference data that you created in file or database form. If you created database dictionaries in Data
Quality 8.6.2, the export process converts these to file. The import process reads reference data files into
the 9.0.1 Model repository and staging area.
Informatica dictionary files that the process does not recognize as part of the Data Quality 9.0.1 Content
Installer file set. The process exports Country Pack and Region Pack files.
The migration process does not export the following types of reference data:
Informatica reference data shipped by default with the Data Quality 9.0.1 Content Installer
Note: Each version of Informatica 9.0.1 performs reference data migration in a different way. You must run
migration files that are compatible with your version of Informatica 9.0.1.
The following table describes the differences between each release:
Informatica Release
Cross-Version Compatibility
13
The migration process recognizes Informatica reference data even if the reference data file name has
changed between versions 8.6.2 and 9.0.1. If an 8.6.2 plan reads a reference data file that is represented by
a reference table in 9.0.1, the migration process updates the imported mapping to read the new reference
table.
The migration process requires that Data Quality 8.6.2 dictionaries use UTF-8 encoding. If your Data Quality
8.6.2 dictionaries use encodings other than UTF-8, convert the dictionaries to UTF-8 before migration.
14
CHAPTER 2
Migration Prerequisites, 17
ClientPackage. Exports the 8.6.2 repository contents and copies reference dictionary data to the file
system. The batch processes compresses and save the files in a format legible to the ServerImport batch
file.
You can append parameters to the ClientPackage batch file to read plan metadata from the file system
and not from the 8.6.2 repository. You must use these parameters when migrating metadata from a Data
Quality Server repository.
ServerImport. Extracts and writes reference metadata to the 9.01 Model repository. Extracts and writes
reference data to the 9.0.1 staging database. The file also save plan metadata in a format legible to the
9.0.1 Model repository. It does not write the plan metadata to the Model repository.
Note: You must manually import the plan metadata to the 9.0.1 Model repository.
15
16
Migration Prerequisites
You must verify that the client batch file can access all Informatica Data Quality 8.6.2 objects and data. You
must also understand the changes that migrated objects can undergo during the migration process.
Before you begin the migration process, answer the following questions:
Do the plans read reference data provided by Informatica?
Informatica Data Quality 8.6.2 uses dictionary files as reference data. If you migrate plans that read
dictionary files, you must verify that the dictionaries are accessible on the Data Quality 8.6.2 machine.
The migration process reads the location of the dictionary files from the Data Quality config.xml file.
Default location for configuration file: [install_dir]\config.xml
Example: C:\Program Files\Informatica Data Quality\config.xml
Default location for dictionaries: [install_dir]\Dictionaries
Example: C:\Program Files\Informatica Data Quality\Dictionaries
Note: The migration process ignores most Informatica dictionary files when it exports items from Data
Quality 8.6.2. Use the Data Quality Content Installer to add Informatica reference data to Informatica
Data Quality 9.0.1. Ensure that you include Country Pack dictionaries and premium address reference
data files read by the 8.6.2 plans when you run the Content Installer.
Run the Server and Client Content Installers before you perform any migration tasks on an Informatica
Data Quality 9.0.1 machine.
Do the plans read from or write to database tables?
If the plans read from or write to a database, take note of the database connection details. Verify that the
9.0.1 Data Integration Service can access the database host machines.
If the plans read from or write to files, copy these files to a location accessible to the 9.0.1 Data
Integration Service. You can set the location of source and target files in the migration.properties file.
Is Informatica Data Quality 9.0.1 installed, and are the required services running?
The following 9.0.1 services must be running before you import migrated files:
Analyst Service
Have you created a project in the Model repository for the data you want to import?
Create this project before you import migrated files. Create a folder in the project to store the reference
tables created from the 8.6.2 dictionary files.
Have you reviewed the migration.properties file?
Before you run the ServerImport process on the Data Quality 9.0.1 system, must review the
migration.properties file and verify that the property settings are correct for your environment and the
migration objects.
Migration Prerequisites
17
18
Property
Description
DSO.DefaultSourceFolder
The path to the folder that you want to contain flat file data sources in Data
Quality 9.0.1. Set this property if you want all flat file data objects to read
data from a single location. This location must be accessible in the Data
Quality 9.0.1 server environment.
DSO.DefaultTargetFolder
The path to the folder that you want to contain flat file data targets in Data
Quality 9.0.1. Used by the ServerImport process reads this property. Set
this property if you want all flat file data objects to write data to a single
location. This location must be accessible in the Data Quality 9.0.1 server
environment.
EDR.Host
EDR.Port
The port number that the ServerImport process uses to communicate with
Informatica 9.0.1 services. This port number must match the Service
Manager port used during Data Quality 9.0.1 installation process. Default is
6006.
Locale.Client
Migration.Formatter
Migration.LogLevel
Report.Format
The file format of report files. Set the property to HTML or XML. Default is
HTML.
Report.Generate
RTM.AtService
Property
Description
RTM.ContentProject
The Model repository project that contains reference data installed by the
Content Installer. If the plans you export from Data Quality 8.6.2 read
Informatica dictionaries, the migration process can link the imported
transformations to the Informatica 9.0.1 reference data.
Set RTM.MapRTM to Yes to enable imported objects to read Informatica
reference data.
RTM.ContentRootDirectory
RTM.Host
RTM.ImportSet
Determines the dictionary files that are written as reference tables during
the import process. If the ClientPackage process identifies a large quantity
of unused dictionaries, set this to UsedOnly. Default is All.
RTM.MapRTM
RTM.Repository
RTM.UserProject
The Model repository project that reference data and mappings import to.
Create the project before you run ServerImport.
RTM.UserRootDirectory
The folder within RTM.UserProject to contain the reference data read by the
imported objects. Create the folder before you run ServerImport.
Server.Password
Password for Data Quality 9.0.1. You must have read and write permissions
on the project folders that you import to.
Server.UserName
User name for Data Quality 9.0.1. You must have read and write
permissions on the project folders that you import to.
Stage.Oracle
The staging database type. If you have configured a staging database and
schema of a particular type, update the property with the name of the
connection that uses the database and schema. Default for each property is
blank.
Stage.SqlServer
Stage.ODBC
Stage.MySQL
Find nodemeta.xml.
The default location of this file is
<Informatica_services_installation_directory>/isp/config
Migration Prerequisites
19
2.
3.
4.
Database Considerations
If Data Quality 8.6.2 uses multiple staging database types, you must ensure that a database or schema and a
connection object exist for each type. Add the name of each connection object to the Stage.<databasetype>
property in the migration.properties file.
Informatica connects to Microsoft SQL Server and MySQL databases through ODBC. The migration process
creates connection objects for these databases, but you must ensure that the ODBC Data Source has been
created in the Informatica 9.0.1 server environment.
For example, if Data Quality 8.6.2 connects to a Microsoft SQL Server connection through ODBC DSN
'MS_SQL_CONNECTION,' the connection object created also uses this name. If the ODBC DSN on the
server has a different name, edit the name of the DSN or the connection object so that they are consistent.
If you are using MySQL as a staging database in Data Quality 8.6.2, complete the following tasks before
migrating to Data Quality 9.0.1:
1.
Create a MySQL 5.0 database that is accessible to the Data Quality 9.0.1 server system.
2.
3.
In the Data Quality 9.0.1 connection manager, create a new ODBC connection object that uses this
ODBC DSN.
4.
Open the Migration.properties file and set the value of the Stage.MySQL property to the name of the
ODBC connection object.
When you run the server import process, plans that refer to the Data Quality 8.6.2 staging database will not
be configured to use the connection object specified by Stage.MySQL.
20
UsedOnly
Imports reference data from the migration package if the data is used by an imported mapping object.
You may want to prevent the import of unused reference data if your Data Quality 8.6.2 installation
contains many unused dictionaries.
None
Does not import any dictionaries.
21
Server installations
Use Workbench to export plan metadata to the file system on a Server repository machine. Run
ClientPackage.bat on the Server repository machine to create a compressed migration file that contains
the plan metadata and the Server reference data.
2.
3.
Description
-d
-f
Path to a folder that contains plans already exported from the Data Quality
repository. Use this parameter if you have used Workbench to export repository
contents to file. Do not use with the -r parameter.
-o
-r
Server repository export only. Specifies that ClientPackage.bat will run on a remote
Data Quality repository and extract plan and reference data to the Workbench file
system.
-s
The batch process creates the compressed migration file that contains the exported repository and
reference data files.
4.
22
Use Data Quality Workbench to export plans from the Data Quality Server repository.
Create the exported XML files on the Server repository machine.
2.
3.
4.
Description
-d
Optional. Path to the directory where the batch file creates the
MigrationPackage.zip.
-f
Path to the directory that contains plans exported from the Data Quality Server
repository. Do not use with the -r parameter.
-o
-r
-s
The batch process creates a MigrationPackage.zip file that contains the exported repository and data files.
Copy the compressed file that contains the exported repository objects and reference data to the Model
repository host machine.
The default name of the compressed repository objects file is MigrationPackage.zip.
2.
3.
Open the migration.properties file from the extracted files. Update migration.properties with the following
information:
23
Name of the project and folder to contain the user-defined reference data.
Name of the project and folder that contains the Informatica reference data.
The locale setting on the Data Quality Workbench that last edited the plans. If required, use the
Locale.Client property to set the locale.
4.
5.
6.
Description
-f
Required. Path to the folder that contains the compressed repository objects file.
-d
Optional. Specify an alternative Output folder for the mapping XML file.
-o
Optional. Specify a new name for the exported objects XML file.
-p
-s
The ServerImport batch process creates the XML that you import to the 9.0.1 Model repository and
staging area. The process creates the XML in a subfolder named Output in the folder that contains the
serverImport batch file.
7.
Review the ServerMigrationReport.html file and any other report files created by the ServerImport
process. Address any issues that arose during the process.
8.
9.
Open the Developer tool and import the mapping XML to the Model repository.
If the number of 8.6.2 plans is too large for a single XML file, ServerImport creates multiple files. In this
case, you must import the files in numerical order by file name, starting with the lowest-numbered file.
When the plans are imported, they appear as mappings in a folder in the Model repository. Any 8.6.2
transformations that convert to mapplets are also saved to a separate folder.
Note: The migration process creates connection objects for the databases that are used by the migrated
plans. Update the JDBC string information in the database connection objects.
24
CHAPTER 3
Overview, 25
Overview
Because some transformations function differently in Data Quality 9.0.1 than in Data Quality 8.6.2, you may
observe differences in mapping configuration and data output following migration. For example, the 9.0.1
Labeler transformation outputs some tokens differently than the 8.6.2 Token Labeler component.
Review the ServerMigrationReport.html file and associated report files to troubleshoot the effects of
migration on Data Quality 8.6.2 plans.
25
In Data Quality 9.0.1, some token label names have changed. The following table lists the changes:
Data Quality 8.6.2
codesymbol
code
num
number
text
word
A Data Quality 9.0.1 mapping does not preserve delimiters between tokens in the token stream. The
migration process replaces all delimiters with a space character.
Data Quality 9.0.1 does not support case-sensitive token labels. The following table shows how different
versions of Data Quality label the string seattle Seattle SEATTLE:
Data Quality 8.6.2
The migration process converts Token Labeler word outputs and Token Parser text outputs into word
tokens in the 9.0.1 Labeler and Parser transformations. However, the definition of a 9.0.1 word token is
more restrictive than 8.6.2 text output settings and word tokens. A mapping that uses the 9.0.1 word token
may have different output than the original plan.
26
Informatica reference data was not installed to the Data Quality 9.0.1 system.
The ServerImport process assumes that you have run the Content Installer to add Informatica reference
data to Data Quality 9.0.1. The ClientPackage process does not migrate an Informatica dictionary if the
dictionary data is present in the Content Installer file set or in an accelerator pack.
Run the Content Installer on the Data Integration Service machine, or on a machine that the Data
Integration Service can access, to install the reference data you need. Then run the ServerImport process.
A customized dictionary file was not present on the Data Quality 8.6.2 machine when the ClientPackage
process ran.
The ClientPackage process migrates any dictionary file that is not included in an Informatica reference
data set or accelerator pack. In this case, find the missing dictionary, add it to the dictionary folder on the
Data Quality 8.6.2 machine, and rerun the ClientPackage process.
27
Edit the Address Validator transformation and delete the bad links.
Edit the Address Validator transformation to use different ports and recreate the port links.
Create another Address Validator transformation in Data Quality 9.0.1, configure it in parse-only mode,
and connect the required data ports to the new transformation.
The following table lists the affected ports in Data Quality 8.6.2 and alternative ports you can use on a parseonly Address Validation transformation:
Data Quality 8.6.2 Port Name
GlobalAV_ParsedSuiteName
SubBuildingNumber1
GlobalAV_ParsedSuiteRange
SubBuildingName1
GlobalAV_ParsedPre_Direction
StreetPreDirectional1
GlobalAV_ParsedSuffix
StreetPostDescriptor1
GlobalAV_ParsedPost_Direction
StreetPostDirectional1
28
transformations. Although these transformations can merge repeated tokens, input that produces more than
five repeated tokens may result in output data that differs from Data Quality 8.6.2 plan output.
29