Control-M for Dataiku

Dataiku is a centralized data science and machine learning platform that enables teams to build, execute, and maintain data pipelines and machine learning models.

Control-M for Dataiku enables you to do the following:

  • Trigger and monitor Dataiku jobs, automation scenarios, and dataset rule computations.

  • Manage Dataiku credentials in a secure connection profile.

  • Connect to any Dataiku endpoint.

  • Introduce all Control-M capabilities to Control-M for Dataiku, including advanced scheduling criteria, complex dependencies, resource pools, lock resources, and variables.

  • Integrate Dataiku jobs with other Control-M jobs into a single scheduling environment.

  • Monitor the status, results, and output of Dataiku jobs.

  • Attach an SLA job to the Dataiku jobs.

Setting up Control-M for Dataiku

This procedure describes how to deploy the Dataiku plug-in, create a connection profile, and define a Dataiku job in Control-M SaaS and Automation API.

Before You Begin

  • Verify that Java is installed, as described in Control-M External Java Installation.

  • Verify that Automation API is installed, as described in Setting Up the API.

  • Verify that Agent version 9.0.21.080 or higher is installed.

Begin

  1. Do one of the following:

    • Install: Run one of the following provision image commands:

      • Linux: ctm provision image Dataiku_plugin.Linux

      • Windows: ctm provision image Dataiku_plugin.Windows

    • Upgrade: Run the following command:

      ctm provision agent::update

  2. Create a Dataiku connection profile in Control-M SaaS or Automation API, as follows:

  3. Define a Dataiku job in Control-M SaaS or Automation API, as follows:

To remove this plug-in from an Agent, see Removing a Plug-in. The plug-in ID is DKU052026.

Change Log

The following table describes changes that were introduced in new versions of this plug-in.

Plug-in Version

Details

1.0.00

Initial version