
KafkaConnect
PRODIn this section, we provide guides and references to use the KafkaConnect connector.
Configure and schedule KafkaConnect metadata and profiler workflows from the OpenMetadata UI:
Requirements
Python Requirements
To run the KafkaConnect ingestion, you will need to install:
Metadata Ingestion
All connectors are defined as JSON Schemas. Here you can find the structure to create a connection to KafkaConnect.
In order to create and run a Metadata Ingestion workflow, we will follow the steps to create a YAML configuration able to connect to the source, process the Entities if needed, and reach the OpenMetadata server.
The workflow is modeled around the following JSON Schema
1. Define the YAML Config
This is a sample config for KafkaConnect:
Source Configuration - Service Connection
hostPort: The hostname or IP address of the Kafka Connect worker with the REST API enabled
verifySSL: Whether SSL verification should be perform when authenticating.
Kafka Connect Config: OpenMetadata supports username/password or no Authentication.
Basic Authentication - Username: Username to connect to Kafka Connect. This user should be able to send request to the Kafka Connect API and access the Rest API GET endpoints. - Password: Password to connect to Kafka Connect.
messagingServiceName: Name of the Kafka Messaging Service associated with this KafkaConnect Pipeline Service. e.g. local_kafka.