main-concepts

No menu items for this category

Container

This schema defines the Container entity. A Container is an abstraction for any path(including the top level eg. bucket in S3) storing data in an Object store such as S3, GCP, Azure. It maps a tree-like structure, where each Container can have a parent and a list of sub-folders, and it can be structured - where it contains structured data, or unstructured where no schema for its data is defined.

  • containerDataModel (object): This captures information about how the container's data is modeled, if it has a schema. . Cannot contain additional properties.
    • isPartitioned (boolean): Whether the data under this container is partitioned by some property, eg. eventTime=yyyy-mm-dd. Default: false.
    • columns (array, required): Columns belonging to this container's schema.
  • fileFormat (string): This schema defines the file formats for the object/files within a container. Must be one of: ["zip", "gz", "zstd", "csv", "tsv", "json", "parquet", "avro"].

Documentation file automatically generated at 2025-01-15 09:05:25.266839+00:00.