DATA STAGE ONLINE TRAINING

0
DATA STAGE COURSE CONTENT
Unit -1: Data Warehouse Fundamentals


An introduction to Data Warehousing
Purpose of Data Warehouse
Data Warehouse Architecture
Operational Data Store
OLTP Vs Warehouse Applications
Data Marts
Data marts Vs Data Warehouses
Data Warehouse Life cycle.

Unit -2: Data Modeling


Introduction to Data Modeling
Entity Relationship model (E-R model)
Data Modeling Data WarehouseNormalization process
Dimensions and fact tables  
Star Schema and Snowflake Schemas.

Unit -3: ETL Design Process


Introduction to Extraction,Transformation & Loading
Types of ETL Tools
Key tools in the market.

Unit – 4: Introduction to Data stage Version 7.5x2 & 8.1& 8.5


Data stage introduction
IBM information Server architecture
Data stage components
Data Stage main functions
Client components- Adding different .
Servers to our workspace
Unit – 5:  Data stage Administrator


Data stage project Administration
Editing projects and Adding Projects
Deleting projects Cleansing up project files
Environmental Variables
Environment management  
Auto purging
Runtime Column Propagation(RCP)
Add checkpoints for sequencer
NLS configuration
Generated OSH (Orchestra Engine)
System formats like data, timestamp
Projects protect – Version details.

Unit – 6:  Data stage Director


Introduction to Data stage Director
Validating Data stage Jobs
Executing Data stage jobs  
Job execution status
Monitoring a job
Job log view  
Job scheduling
Creating Batches
Scheduling batches.

Unit – 7:  Data stage Designer


Introduction to Data stage Designer
Importance of Parallelism
Pipeline Parallelism
Partition Parallelism
Partitioning and collecting(In depth coverage of
partitioning and collective techniques)
Symmetric Multi Processing (SMP)
Massively Parallel Processing (MPP)
Introduction to Configuration file
Editing a Configuration file
Partition techniques  
Data stage Repository Palette  
Passive and Active stages  
Job design overview
Designer work area
Annotations
Creating jobs
Importing flat file definitions
Managing the Metadata environment
Dataset management
Deletion of Dataset  
Routines

Unit – 8:  Working with Parallel Job Stages


Database Stages
Oracle
ODBC
 Dynamic RDBMS
File Stages
Sequential file
Dataset
File set  
Lookup file se
Processing Stages
Copy
Filter
Funnel
Sort
Remove duplicate
Aggregator
Switch
Pivot stage
Lookup
Join
Merge
Difference between look up, join and merge
Change capture
External Filter
Surrogate key generator
Transformer
Real time scenarios using different Processing Stages -
 Implementing different logics using Transformer
Debug Stages
Head
Tail
Peek
Column generator
Row generator
Write Range Map Stage
Real Time Stages
XML input 
XML output
Local and Shared containers
Routines creation
Extensive usage of Job parameters, Parameter Sets,
Environmental variables in jobs
Introduction to predefined Environmental variables
creating user defined Environmental variables and
implementing the same in parallel jobs
Unit – 9:  Advanced Stages in Parallel Jobs (Version 8.1)


Explanation of Type1 and Type 2 processes
Implementation of Type1 and Type2 logics using
Change Capture stage and SCD Stage
Range Look process  
Surrogate key generator stage
FTP stage  
Job performance analysis
Resource estimation
Performance tuning

Unit – 10: Job Sequencers


Arrange job activities in Sequencer
Triggers in Sequencer
Restablity
 Recoverability
Notification activity  
Terminator activity  
Wait for file activity
Start Loop activity
Execute Command activity
Nested Condition activity
Exception handling activity  
User Variable activity
End Loop activity
Adding Checkpoints
Jobs used in different real time scenarios.
Explanation of Sequence Job stages through
different Jobs
Unit – 11: IBM Information Server Administration Guide


IBM Web Sphere Data stage administration
Opening the IBM Information Server Web console –
Setting up a project ion the console
Customizing the project dashboard
Setting up security  
Creating users in the console
Assigning security roles to users and groups
Managing licenses
Managing active sessions
Managing logs
Managing schedules
Backing up and restoring IBM Information Server.
Additional Features
Performance Tuning of Parallel Jobs.
Data stage Installation process and setup.
Project Explanation.


0 comments: