Course Overview
This course is designed for beginners in DataStage as well as for those who have worked int DataStage thick client. It begins with introducing the new interface for DataStage: the DataStage Flow. The course covers commonly used stages, cloud connecters and advanced connectors available in DataStage on IBM Cloud Pak for Data (CP4D).
The course has Lab Exercises for the Units.
Audience
DataStage beginner or DataStage users who have used the thick client and want to use the Cloud Pak for Data for DataStage.
Prerequisites
- Understanding of Databases and ETL
- Basic Familiarity with Cloud services.
Topics
Day 1
- Unit 1: Overview of Cloud Pak for Data
- Introduction of CP4D
- Login
- Home Page
- Profile and Settings
- Services Catalog
- CP4D Architecture
- Introduction of CP4D
- Unit 2: Transforming Data using DataStage on CP4D
- Create DataStage Project
- Create DataStage Parallel Job
- Save and Compile DataStage Job
- Run and monitor DataStage job
- Configuration Files for DataStage on CP4D
- Import and Export DataStage job
- Add Environment Variables
- Job run settings.
- Asset browser
- Export and Import Project
- Clone a job
- Unit 3: Processing Stages
- Merge Stage
- Join Stage
- Sort Stage
- Transformer Stage
- Remove Duplicates Stage
Day 2
- Unit 4: Amazon S3 Connector
- Unit 5: Amazon RedShift Connector
- Unit 6: Microsoft Azure Connector
Day 3
- Unit 7: Google Big Query Connector
- Unit 8: Snowflake Connector
- Unit 9: Checksum Stage
Day 4
- Unit 10: Change Capture and Change Apply
- Unit 11: Dataset
- Unit 12: HTTP Connector
- Unit 13: ODBC Connector
- Unit 14: Compress
- Unit 15: Expand
- Unit 16: Decode
- Unit 17: Encode
Day 5
- Administration:
- Cluster administration
- Scaling services
- Add nodes to your Cloud Pak for Data cluster
- Backup and restore your deployment
- Backup and restore service list
- Backing up and restoring an entire deployment
- Backing up and restoring volumes
- Migrating Cloud Pak for Data metadata and clusters
- Cluster administration
- Cloud Pak for Data platform administration
- Managing users
- Connecting to your identity provider
- Predefined roles and permissions
- Managing roles and user groups
- Importing JDBC drivers for data sources
- Monitoring the platform
- Managing storage volumes
- Gathering diagnostic information
- Customizing the platform
- Managing users
Duration: 5 Days(5 hours a day)
Time: 8 AM EET
Delivery Method: Virtual
Language: English
Course ID: NFCP4D-DS4.0-EET09
Price: $2500
For a Group Training Contact Us
For further details and inquiries about training programs, please get in touch with us.