Components

Airbyte

The ingestion and ETL component in ProAI.

Agentic Friendly

Component Category

Data ingestion and ETL

Component Description

Airbyte is the ingestion and synchronization component used in ProAI for moving data between source systems and target platforms. It brings a connector-based approach to data movement, which makes it easier to ingest and synchronize data without building custom pipelines for every source.

Why It Is Used

It gives the platform a repeatable way to connect external data sources, synchronize datasets, and feed analytical destinations without building custom ingestion logic for every integration. This is especially useful when teams need broad connector coverage, operationally consistent ELT flows, and a path for extending integrations over time.

Learn More

Interacts With

  • ClickHouse as one of the analytical destinations used in the platform
  • MinIO for object-storage-based data paths and related configuration
  • PostgreSQL for Airbyte state and metadata
  • Temporal as part of the orchestration setup used by the deployed service
  • Keycloak and related access controls for protected access patterns

On this page