Run Data Insights using Airflow SDK
1. Define the YAML Config
This is a sample config for Data Insights:
Source Configuration - Source Config
- To send the metadata to OpenMetadata, it needs to be specified as
type: MetadataToElasticSearch
.
Processor Configuration
- To send the metadata to OpenMetadata, it needs to be specified as
type: data-insight-processor
.
Workflow Configuration
The main property here is the openMetadataServerConfig
, where you can define the host and security provider of your OpenMetadata installation.
For a simple, local installation using our docker containers, this looks like:
We support different security providers. You can find their definitions here. You can find the different implementation of the ingestion below.
2. Prepare the Data Insights DAG
Create a Python file in your Airflow DAGs directory with the following contents: