A guide to building long-term compounding knowledge infrastructure. See details on GitHub .

Dagster

A cloud-native orchestration and development platform for data assets, with strong observability and a developer-friendly programming model.

Overview

Dagster is an orchestration platform and development framework for data assets. It emphasizes testability, observability, and a modular development workflow, helping teams define, run, and monitor data assets with robust lineage and tooling.

Key features

  • Declarative assets and graph models for organizing data logic.
  • Built-in observability, logging and lineage visualization.
  • Large integration ecosystem and pluggable execution backends.

Use cases

  • Data engineering and ML workflow orchestration at scale.
  • Bringing data assets into standard software engineering practices.
  • Centralized scheduling, monitoring, and governance of data tasks.

Technical notes

Dagster is Python-native, supports containerized deployments, and integrates with CI/CD pipelines and third-party tools to provide an engineering-first data platform experience.

Comments

Dagster
Resource Info
🌱 Open Source 📱 Application 🛠️ Dev Tools