Control-M for IBM DataStage

IBM DataStage enables you to construct data handling jobs that automate both ETL (extract, transform and load) and ELT (extract, load and transform) processes.

Control-M for IBM DataStage enables you to do the following:

  • Connect to any supported IBM DataStage server from a single Control-M for IBM DataStage installation.

  • Connect to any supported database from a single host with secure login, which eliminates the need to provide authentication.

  • Manage IBM DataStage credentials in a secure connection profile.

  • Introduce all Control-M capabilities to Control-M for IBM DataStage, including advanced scheduling criteria, complex dependencies, resource pools, lock resources, and variables.

  • Integrate IBM DataStage jobs with other Control-M jobs into a single scheduling environment.

  • Monitor the status, results, and output of IBM DataStage jobs.

  • Attach an SLA job to the IBM DataStage jobs.

Preparing a Linux Host for IBM DataStage Job Execution

This procedure describes how to prepare a Linux host so that Control-M/Agent can run IBM DataStage jobs through Application Integrator.

For this procedure, use a dedicated service account with the minimum required permissions.

Before You Begin

  • Verify that IBM DataStage is installed on a Linux host.

  • Verify that the Control-M/Agent package is available.

  • Verify that you have Admin access to configure an IBM DataStage user.

Begin

  1. Create a dedicated operating system user for Control-M/Agent.

  2. Assign the user to the DataStage group and ensure that the user has permission to run the dsjob command.

  3. Install Control-M/Agent under that user.

  4. Before you run a job, initialize the IBM DataStage environment with the ../dsenv command.

  5. Validate the dsjob command as follows:
    Path_To_File/dsjob

  6. Run an IBM DataStage job as follows:
    Path_To_File/dsjob -run -jobstatus <project_name> <job_name>

  7. If command execution fails, verify missing library dependencies and update LD_LIBRARY_PATH, as needed.

Setting up Control-M for IBM DataStage

This procedure describes how to deploy the IBM DataStage plug-in, create a connection profile, and define an IBM DataStage job in Control-M SaaS and Automation API.

Before You Begin

  • Verify that Java is installed, as described in Control-M External Java Installation.

  • Verify that Automation API is installed, as described in Setting Up the API.

  • Verify that Agent version 9.0.21.080 or higher is installed.

Begin

  1. Do one of the following:

    • Install: Run one of the following provision image commands:

      • Linux: ctm provision image datastage_linux_plugin.Linux

      • Windows: ctm provision image datastage_windows_plugin.Windows

    • Upgrade: Run the following command:

      ctm provision agent::update

  2. Create an IBM DataStage connection profile in Control-M SaaS or Automation API, as follows:

  3. Define an IBM DataStage job in Control-M SaaS or Automation API, as follows:

To remove this plug-in from an Agent, see Removing a Plug-in. The plug-in ID for Linux is DSL122024 and the plug-in ID for Windows is DSW122024.

Change Log

The following table provides details about changes that were introduced in new versions of this plug-in:

Plug-in Version

Details

1.0.01

Added return code handling of codes 21 and 255 (on Linux) or -2 (on Windows) in the Run RESET execution step.

1.0.00

Initial version