Codebase Deep Dives

These guides explain how key OpenMetadata modules work under the hood. They are written for contributors who want to understand the call chains, class hierarchies, and design patterns before diving into the code.

Data Quality — Test definitions, validators, execution pipeline, and how to add new tests
Profiler — Metrics system, computation engine, threading model, and how to add new metrics
Sampler — Sampling strategies, database-specific implementations, and integration with profiler and data quality
Metadata Ingestion — Topology-based execution, source hierarchy, and how connectors extract metadata
Usage Ingestion — Query log collection, SQL parsing, aggregation, and publishing
Lineage Ingestion — SQL parsing, graph analysis, column-level lineage, and cross-database lineage

Playwright Integration Tests | Official Documentation

Data Quality Module | Technical Architecture

​Codebase Deep Dives

Codebase Deep Dives