A guide to building long-term compounding knowledge infrastructure. See details on GitHub .

Airbyte

Open-source data integration platform for building ELT/ETL pipelines from APIs, databases, and files to warehouses and lakes.

Overview

Airbyte is an open-source data integration platform that simplifies building ELT/ETL pipelines from various sources to warehouses, lakes, or other destinations. It offers a large connector ecosystem, low-code connector development, and both self-hosted and cloud deployment options.

Key features

  • Extensive connector catalog for APIs, databases, and files.
  • Low-code connector builder and CDK for custom connectors.
  • Cloud and self-hosted deployment with monitoring and governance.

Use cases

  • Centralizing logs, events and business data from many sources.
  • Building continuous ingestion into data lakes and warehouses.
  • Quick proofs-of-concept and migrations for data teams.

Technical notes

Modular architecture with SDKs and integrations; Airbyte integrates with orchestration tools (Airflow, Dagster, Prefect) and fits into modern data platform pipelines.

Comments

Airbyte
Resource Info
🌱 Open Source 💾 Data 🔗 Connector