A guide to building long-term compounding knowledge infrastructure. See details on GitHub .

Unity Catalog

An open, multimodal catalog for data and AI that provides unified governance, metadata management, and access control.

Overview

Unity Catalog centralizes metadata and governance for data and AI assets, enabling consistent policy enforcement and discovery across compute engines. It supports multi-format and multi-engine integrations and is designed to serve enterprise metadata needs.

Key features

  • Multi-format and multi-engine support (Delta, Iceberg, Hudi, Spark, DuckDB, Trino).
  • Unified governance, audit and access control capabilities.
  • CLI, SDKs and REST APIs for integration and automation.

Use cases

  • Enterprise metadata governance and cataloging for data and AI assets.
  • Providing a consistent access layer across heterogeneous compute environments.
  • Discovery and reproducibility for data science teams.

Technical notes

Open-source project with multi-language components focusing on interoperability and governance integration.

Comments

Unity Catalog
Resource Info
🌱 Open Source 💾 Data 🔗 Connector