Control-M for GCP Dataplex
GCP Dataplex is an extract, transform, and load (ETL) service that enables you to visualize and manage data in GCP BigQuery and the cloud.
Control-M for GCP Dataplex enables you to do the following:
-
Execute any of the following job actions:
-
Data Quality Task: Executes a predefined data quality task in GCP BigQuery or Google Cloud Storage locations, and defines data controls in BigQuery environments.
-
Custom Spark Task: Executes a predefined, scheduled Apache Spark task to analyze and process your data.
-
Data Profiling Scan: Executes a predefined data scan to identify shared statistical characteristics between BigQuery tables.
-
Data Quality Scan: Executes a predefined data quality scan that validates your data and logs alerts when the data fails validation.
-
-
Manage GCP Dataplex credentials in a secure connection profile.
-
Connect to any GCP Dataplex endpoint.
-
Introduce all Control-M capabilities to Control-M for GCP Dataplex, including advanced scheduling criteria, complex dependencies, Resource Pools, Lock Resources, and variables.
-
Integrate GCP Dataplex jobs with other Control-M jobs into a single scheduling environment.
-
Monitor the status, results, and output of GCP Dataplex jobs.
-
Attach an SLA job to the GCP Dataplex jobs.
-
Run 50 GCP Dataplex jobs simultaneously per Agent.
Setting up Control-M for GCP DataplexLink copied to clipboard
This procedure describes how to deploy the GCP Dataplex plug-in, create a connection profile, and define a GCP Dataplex job in
Before You Begin
-
Verify that Automation API is installed, as described in Setting Up the API.
-
Verify that Agent version 9.0.21.080 or later is installed.
-
On the Agent host, run one of the following commands to set the Java environment variable:
-
Linux:
-
Bourne shell/bash: export BMC_INST_JAVA_HOME=<java_11_directory>
-
csh/tcsh: setenv BMC_INST_JAVA_HOME <java_11_directory>
-
-
Windows: set BMC_INST_JAVA_HOME="<java_11_directory>"
-
-
Run one of the following API commands:
-
To install, type one of the following provision image commands:
-
Linux: ctm provision image GCP_Dataplex_plugin.Linux
-
Windows: ctm provision image GCP_Dataplex_plugin.Windows
-
-
To upgrade, type the following command:
ctm provision agent::update
-
-
Create a GCP Dataplex connection profile in Control-M SaaS or Automation API, as follows:
-
Control-M SaaS: Create a Centralized Connection Profile with GCP Dataplex Connection Profile Parameters
-
Automation API: ConnectionProfile:GCP Dataplex
-
-
Define a GCP Dataplex job in Control-M SaaS or Automation API, as follows:
-
Control-M SaaS: Create a Job with GCP Dataplex Job parameters
-
Automation API: Job:GCP Dataplex
-
To remove this plug-in from an Agent, see Removing a Plug-in. The plug-in ID is GDQ112023.
Change LogLink copied to clipboard
The following table provides details about changes that were introduced in new versions of this plug-in:
Plug-in Version |
Details |
---|---|
1.0.00 |
Initial version |