InfoSphere QualityStage Essentials v11.5 - KM213G

InfoSphere QualityStage Essentials v11.5 - KM213G

Audience

This course is intended for:

  • Data Analysts responsible for data quality using QualityStage
  • Data Quality Architects
  • Data Cleansing Developers

Prerequisites

Participants should have:

  • Familiarity with the Windows operating system
  • Familiarity with a text editor

Helpful, but not required, would be some understanding of elementary statistics principles such as weighted averages and probability.

Duration

4 days.

Course Objectives

This course teaches how to build QualityStage parallel jobs that investigate, standardize, match, and consolidate data records. Students will gain experience by building an application that combines customer data from three source systems into a single master customer record.

Course objectives include:

  • List the common data quality contaminants
  • Describe each of the following processes:
    • Investigation
    • Standardization
    • Match
    • Survivorship
  • Describe QualityStage architecture
  • Describe QualityStage clients and their functions
  • Import metadata
  • Build and run DataStage/QualityStage jobs, review results
  • Build Investigate jobs
  • Use Character Discrete, Concatenate, and Word Investigations to analyze data fields
  • Describe the Standardize stage
  • Identify Rule Sets
  • Build jobs using the Standardize stage
  • Interpret standardization results
  • Investigate unhandled data and patterns
  • Build a QualityStage job to identify matching records
  • Apply multiple Match passes to increase efficiency
  • Interpret and improve match results
  • Build a QualityStage Survive job that will consolidate matched records into a single master record
  • Build a single job to match data using a Two-Source match

Course Content

Data Quality Issues
Listing the common data quality contaminants
Describing data quality processes

QualityStage Overview
Describing QualityStage architecture
Describing QualityStage clients and their functions

Developing with QualityStage
Importing metadata
Building DataStage/QualityStage Jobs
Running jobs
Reviewing results

Investigate
Building Investigate jobs
Using Character Discrete, Concatenate, and Word Investigations to analyze data fields
Reviewing results

Standardize
Describing the Standardize stage
Identifying Rule Sets
Building jobs using the Standardize stage
Interpreting standardize results
Investigating unhandled data and patterns

Match
Building a QualityStage job to identify matching records
Applying multiple Match passes to increase efficiency
Interpreting and improving Match results

Survive
Building a QualityStage survive job that will consolidate matched records into a single master record

Two-Source Match
Building a QualityStage job to match data using a reference match

Virtual Courses

ALL of our courses can be delivered virtually. And our Bath public schedule of courses are now available as live virtual sessions, using the popular Zoom Virtual Classroom and remote labs. Delegates can test their access at: www.zoom.us/test

On-Site Courses

Can't attend one of our public classes? Booking for multiple people?

All our courses are available on your site! Delivered for your staff, at your premises.

Contact us to find out more...