You are on page 1of 10

4-15, 4-62 in pdf(my documents)

http://www.scribd.com/doc/20121/!"t"#t"$e-!esi$ner
http://pub%ib.bou%der.ibm.com/infocenter/iisinfs&/&'r0/inde(.)sp*
topic+/com.ibm.sw$.im.iis.producti,"tion.iisinfs&.o&er&iew.doc/topics/cisodsc%oser.ht
m%
http://et%-too%s.info/en/d"t"st"$e-tutori"%--001.d"t"st"$e-modu%es.htm
/erms
DataStage Administrator A tool used to configure DataStage projects and users.
DataStage Designer A graphical design tool used by the developer to design and develop a
DataStage job.
DataStage Director A tool used by the operator to run and monitor DataStage server jobs.
DataStage Manager A tool used to view and edit definitions in the Repository.
DataStage Package Installer A tool used to install packaged DataStage jobs and plug-ins.
job A collection of linked stages, data elements,and transforms that define how to etract,
cleanse, transform, integrate, and load data into a target database. !obs can either be server
jobs or mainframe jobs. data-flow (the job).
Join stage A mainframe processing stage or parallel job active stage that joins two input
sources.
Look! stage A mainframe processing stage and "arallel active stage that performs table
lookups
Data Set stage A parallel job stage. Stores a set of data.
"racle stage A parallel job stage that allows you to read and write an #racle database.
!arallel e#tender $he DataStage option that allows you to run parallel jobs.
!arallel job A type of DataStage job that allows you to take advantage of parallel processing on
S%", %"",and cluster systems.
!lg-in A definition for a plug-in stage.
!lg-in stage A stage that performs specific processing that is not supported by the standard
server job stages.
$)%e!ositor& A DataStage area where projects and jobs are stored as well as definitions for all
standard and user-defined data elements, transforms, and stages.
a)meta data Data about data, for eample, a table definition describing columns in
which data is
structured.
b)shared container A container which eists as a separate item in the Repository and
can be used by any server job in the project.
'ransform fnction A function that takes one value and computes another value from
it.
'ransformer (ditor A graphical interface for editing $ransformer stages.
'ransformer stage A stage where data is transformed &converted'
using transform functions.
/he !"t"#t"$e components :
Designer - used to create DataStage jobs which are compiled into executable
programs.It is a graphical, user-friendly application which applies visual data flow
method to develop job flows for extracting, cleansing, transforming, integrating
and loading data. Its a module mainly used by Datastage developers.
Director - manages running, validating, scheduling and monitoring DataStage
obs. Its mainly used by operators and testers.
Manager - it!s a main interface to the Datastage "epository, allows its browsing
and editing. It displays tables and files layouts, routines, transforms and
jobs defined in the project. It is mainly used to store and manage reusable
metadata.
Administrator - #dministers DataStage projects, manages global settings and
interacts with the system. #dministrator is used to specify general server
defaults, add and delete projects, set up project properties and provides
a command interface to the datastage repository.
$ith Datastage #dministrator users can set job monitoring limits, user
privileges, job scheduling options and parallel jobs default.
DIFFEE!"E SE#E A!D $AA%%E% &'(S
)http*++,,,.gee-intervie,.com+tal-+./.01datastage1parallel1jobs1vs1
datastage1server.html
%& 'he basic difference between server and parallel jobs is the degree of parallelism
Server job stages do not have in built partitoning and parallelism mechanism for
extracting and loading data between different stages.
23 Datastage server executed by datastage server environment
Datastage parallel executed (nder control of datastage runtime environment
)& Datastage Server compiled in to *#SI+,interpreted pseudo code&
Datastage -arallel compiled to .S/,.rchestrate Scripting 0anguage&.
1& 2ore -rocessing stages are not included in Server example, oin, +D+,
0oo3up etc4..
5& In 6ile stages, /ash file available only in Server and
+omplex falat file , dataset , loo3up file set avail in parallel only.
7& Server 'ransformer supports basic transforms only
parallel both basic parallel transforms.
8& Server transformer is basic language compatability
parallel transformer is c99 language compatabillity
:& 0oo3 up of se;untial file is possible in parallel jobs
$AA%%E% E45E!DE,-x& ta3es advantage of both pipeline parallelism and
partitoning paralellism.
$ipeline parallelism means that as soon as data is available between stages, in
pipes or lin3s&, it can be exchanged between them without waiting for the entire
record set to be read.
$artitioning parallelism means that entire record set is partitioned into small sets
and processed on different nodes,logical processors&. 6or example if there are %<<
records, then if there are 1 logical nodes then each node would process =5 records
each. 'his enhances the speed at which loading ta3es place to an ama>ing degree.
Imagine situations where billions of records have to be loaded daily. 'his is where
datastage -? comes as a boon for @'0 process and surpasses all other @'0 tools in
the mar3et.
httpABBwww.gee3interview.comB;uestionCdetailsB%=8):
$hat are outines and whereBhow are they written and have you
written any routines beforeD
Routines are stored in the Routines branch of the DataStage Repository,
where you can create, view or edit. The following are different types of
routines:
1) Transform functions
2) Before-after job subroutines
3) Job Control routines
'rchestrate #s Datastage $arallel E6tender7
Orchestrate itself is an ETL tool with etensive parallel processing
capabilities and running on !"#$ platfor%. Datastage used Orchestrate
with Datastage $E &'eta version of (.)* to incorporate the parallel
processing capabilities. "ow Datastage has purchased Orchestrate and
integrated it with Datastage $E and released a new version Datastage (.)
i.e +arallel Etender.
httpABBwww.s;uidoo.comBlearningdatastage
!ifference between 0outines "nd /r"nsformer is that both are same to pronounce but
0outines describes the 1usiness %o$ic and
/r"nsformer specifies that transform the data from one location to another by applyi
ng the changes by using transformation rules .
'y using Routines we can return values but by transfor%ers we cannot
return values.
httpABBwww.bidevs.comBarticle-%8-datastage-oracle-stage---extract-your-data-
faster.html
Data Sets and 6ile Sets
Epalin difference between Data Sets and ,ile Sets- where each one is
used and how they are used-
.hat is the si/e li%it-
'he dataset and fileset both are file stage.
Dataset can have single input lin3 or single output lin3 whereas 6ileset can have single input
lin3 or single output lin3 and a single reject lin3.
6ile set can executes only in parallel mode whereas dataset can be configured to execute
either in parallel or in se;uential mode.'he default si>e for both are 28(.
S'#E@ IF-(' 0IFG .('-(' 0IFG "@@+' 0IFG "@6@"@F 0FG
FI%E
S@H(@F'I#0 % I %I %
6I0@ S@' % I % I %
D#'# S@' % I % I
$'"ESSI!8
S."' % %
.IF F %
"@2 D(-0 % %
#EE"@E#'." % %
+.-J % F
+/#FE +#-' = %
+/#FE #--0J = %
0..G (- % % %,.-'I.F#0& %
DE#+DE(98
/@#D % %
'#I0 % %
-@@G % F ,+/G&
".$ E@F % ,ch3 n&
+.0 E@F % %
S#2-0@ % F
I - @ither
muslim female teachers jobs +hennai,goog india&
%&
'live International School "hennai * M9S%IM %AD: 5AI!ED 8AD9A5E
5EA";ES+M'!5.+D$$55 5AI!ED
<ebsite* http*++,,,.oliveinternationalschool
httpABBwww.indianjobtal3s.comBforumBshowthread.phpDtKL5)8
=& httpABBwww.jeevansathi.comBprofileBmatrimonial-=151):7M1.htm
httpABBwww.tamilmatrimony.comBmatrimonialsB"eligionB=B2uslim.html
;"% )21= :S ";E!!AI3
ob Description
S3illsA Strong wor3ing 3nowledge in Datastage N preferably Datastage -?O
Eood 3nowledge on @'0 +oncepts
$or3ing 3nowledge in (nix
Eood written and oral communication s3ills.

You might also like