> ## Documentation Index
> Fetch the complete documentation index at: https://docs.open-metadata.org/llms.txt
> Use this file to discover all available pages before exploring further.

# Data Profiler | OpenMetadata Data Profiling Guide

> Explore profiling workflows including histogram metrics, null counts, and field-level health.

# Overview of Data Profiler

The profiler in OpenMetadata helps to understand the shape of your data and to quickly validate assumptions. The data profiler helps to capture table usage statistics over a period of time. This happens as part of profiler ingestion. Data profiles enable you to check for null values in non-null columns, for duplicates in a unique column, etc. You can gain a better understanding of column data distributions through the descriptive statistics provided.

Watch the video to understand OpenMetadata's native Data Profiler and Data Quality tests.

<iframe width="800" height="450" src="https://www.youtube.com/embed/gLdTOF81YpI?start=0&end=4090" frameBorder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowFullScreen />

<CardGroup cols={2}>
  <Card title="Profiler Tab" href="/v1.12.x/how-to-guides/data-quality-observability/profiler/tab">
    Get a complete picture of the Table Profile and Column Profile details.
  </Card>

  <Card title="Profiler Workflow" href="/v1.12.x/how-to-guides/data-quality-observability/profiler/profiler-workflow">
    Configure and run the Profiler Workflow to extract Profiler data.
  </Card>

  <Card title="Metrics" href="/v1.12.x/how-to-guides/data-quality-observability/profiler/metrics">
    Learn about the supported profiler metrics.
  </Card>

  <Card title="Auto Pii Tagging" href="/v1.12.x/how-to-guides/data-quality-observability/profiler/auto-pii-tagging">
    Automatically detect and tag columns containing sensitive PII data during profiling.
  </Card>

  <Card title="Spark Engine" href="/v1.12.x/how-to-guides/data-quality-observability/profiler/spark-engine">
    Use distributed processing with Apache Spark for large-scale data profiling.
  </Card>
</CardGroup>
