how-to-guides

No menu items for this category
OpenMetadata Documentation

Advanced Usage

This guide covers advanced patterns and configurations for Data Quality as Code, including loading tests from YAML files, customizing workflow configurations, and integrating with production systems.

You can load test definitions from YAML workflow files, enabling version-controlled test configurations:

By default, from_yaml() uses the connection configured via configure(). To use the connection from the YAML file:

A complete YAML configuration includes:

Inspect configured tests before running:

Results can be published back to OpenMetadata for tracking, alerting, and visualization:

  • Historical tracking: View trends over time
  • Alerting: Trigger notifications on failures
  • Dashboards: Centralized data quality monitoring
  • Collaboration: Share results across teams
  • Compliance: Maintain audit trails

Implement robust error handling:

Generate tests programmatically based on metadata:

Validate multiple tables in a workflow:

  1. Version control test configurations: Store YAML configs in git
  2. Use environment variables: Never hardcode credentials
  3. Implement retries: Handle transient failures gracefully
  4. Publish results: Enable tracking and alerting in OpenMetadata
  5. Monitor execution: Track metrics for test runs
  6. Handle errors explicitly: Don't silently swallow failures
  7. Document tests: Use descriptive names and descriptions
  8. Validate incrementally: Test early and often in pipelines
  9. Separate concerns: Let data stewards define tests, engineers execute them
  10. Test your tests: Ensure test definitions are correct