Skip to main content

Codebase Deep Dives

These guides explain how key OpenMetadata modules work under the hood. They are written for contributors who want to understand the call chains, class hierarchies, and design patterns before diving into the code.
  • Data Quality — Test definitions, validators, execution pipeline, and how to add new tests
  • Profiler — Metrics system, computation engine, threading model, and how to add new metrics
  • Sampler — Sampling strategies, database-specific implementations, and integration with profiler and data quality
  • Metadata Ingestion — Topology-based execution, source hierarchy, and how connectors extract metadata
  • Usage Ingestion — Query log collection, SQL parsing, aggregation, and publishing
  • Lineage Ingestion — SQL parsing, graph analysis, column-level lineage, and cross-database lineage