Integrate and transform all your data for ClickHouse

Open-Source
Apache Kafka
Kafka is a distributed event streaming platform designed for high-throughput fault-tolerant data pipelines. It enables real-time data integration messaging and stream processing across systems.

Open-Source
Confluent
Confluent is a managed platform built around Kafka that simplifies operating and scaling streaming data infrastructure. It adds enterprise features like governance connectors and cloud-native deployment.

Enterprise
OpenTelemetry (OTEL)
OpenTelemetry is an open standard for collecting traces metrics and logs from distributed systems. It provides a unified way to instrument applications and export telemetry data to observability backends.

Enterprise
Webhook (HTTP)
A webhook is an HTTP-based event delivery mechanism where external systems send real-time data to a specified endpoint. It enables push-based integrations by triggering automated data ingestion whenever events occur without the need for polling APIs.

Open-Source
Redpanda
Redpanda is a Kafka-compatible streaming platform designed for low-latency and high-performance workloads without JVM overhead. It simplifies operations by eliminating ZooKeeper and reducing infrastructure complexity.

Open-Source
WarpStream
WarpStream is a Kafka-compatible streaming system that decouples storage from compute using object storage like S3. It reduces operational overhead while maintaining Kafka APIs.

Open-Source
AWS MSK
MSK (Managed Streaming for Kafka) is a fully managed Kafka service on AWS. It simplifies deployment scaling and maintenance of Kafka clusters.

Enterprise
PostHog
PostHog is an open-source product analytics platform that captures user events sessions and feature usage in real time. It enables teams to analyze behavior run experiments and build data-driven product insights without relying on third-party SaaS tools.

Enterprise
Apache Iceberg
Iceberg is a high-performance table format for data lakes that brings reliability and SQL-like consistency to large-scale analytics. It supports schema evolution time travel and efficient querying over massive datasets stored in object storage.

Enterprise
Google Pub/Sub
Pub/Sub is a fully managed messaging service by Google Cloud for real-time event ingestion and delivery. It enables scalable asynchronous communication between services using publish–subscribe patterns.

Enterprise
AWS Kinesis
Kinesis is a real-time data streaming service that collects processes and analyzes large volumes of data. It is commonly used for log ingestion metrics and event pipelines at scale.

Enterprise
Kinesis Firehose
Firehose is a fully managed delivery service that streams data directly into destinations like S3 Redshift or OpenSearch. It simplifies ingestion pipelines by handling buffering batching and scaling automatically.

Enterprise
AWS CloudWatch
CloudWatch is AWS's monitoring and observability service that collects logs metrics and events from cloud resources. It provides alerting dashboards and insights into system performance and health.

Enterprise
PostgreSQL
PostgreSQL is an advanced open-source relational database known for reliability and extensibility. It supports complex queries ACID transactions and modern features like JSON and vector extensions.

Enterprise
MySQL
MySQL is a widely used open-source relational database optimized for web applications and transactional workloads. It is known for its simplicity performance and broad ecosystem support.

Enterprise
Amazon S3
S3 is a highly durable and scalable object storage service used as the backbone for data lakes. It stores structured and unstructured data and integrates with analytics and processing systems.

Enterprise
Azure Event Hubs
Event Hubs is a large-scale event ingestion service by Azure designed for streaming millions of events per second. It is commonly used for telemetry pipelines logging and real-time analytics.

Enterprise
MariaDB
MariaDB is an open-source relational database forked from MySQL designed for performance and scalability. It offers enhanced storage engines and compatibility with MySQL ecosystems.

Enterprise
Apache Cassandra
Cassandra is a distributed NoSQL database designed for high availability and scalability across multiple nodes. It is optimized for large-scale write-heavy workloads with no single point of failure.

Enterprise
Airbyte
Airbyte is an open-source data integration platform that syncs data from various sources into warehouses and lakes. It supports ELT pipelines with a large library of connectors.

Enterprise
MongoDB
MongoDB is a NoSQL document-oriented database that stores data in flexible JSON-like documents. It enables rapid development and horizontal scaling for modern applications.

Enterprise
Supabase
Supabase is an open-source backend platform built on PostgreSQL offering APIs authentication and real-time capabilities. It provides a Firebase-like experience with SQL-based data storage.

Enterprise
Neon
Neon is a serverless PostgreSQL platform that separates storage and compute for efficient scaling. It enables instant branching autoscaling and cost-efficient database operations.

Enterprise
Segment
Segment is a customer data platform that collects unifies and routes user event data to downstream systems. It simplifies analytics marketing and data infrastructure pipelines.

Enterprise
Google Cloud Storage (GCS)
GCS is a scalable object storage service for storing and retrieving large amounts of data. It is commonly used as a data lake backend for analytics and machine learning.

Enterprise
Delta Lake
Delta Lake is an open-source storage layer that adds ACID transactions and reliability to data lakes. It enables batch and streaming data processing with schema enforcement and versioning.

Enterprise
DynamoDB
DynamoDB is a fully managed NoSQL key-value and document database by AWS. It provides low-latency performance at any scale with automatic scaling and high availability.

Enterprise
Fluent Bit
Fluent Bit is a lightweight high-performance log and metrics collector designed for cloud-native environments. It collects processes and forwards telemetry data from sources like containers and systems to destinations such as Kafka or ClickHouse.

Enterprise
Fluentd
Fluentd is an open-source data collector that aggregates logs and events from multiple sources into a unified pipeline. It provides flexible routing transformation and integration with hundreds of data backends.

Enterprise
Logstash
Logstash is a data processing pipeline that ingests transforms and forwards logs and events. It is commonly used in the Elastic ecosystem for centralized log management.

Enterprise
Grafana Alloy (Agent)
Grafana Alloy is a telemetry collector built on OpenTelemetry that gathers and forwards logs metrics and traces. It integrates tightly with the Grafana ecosystem for observability pipelines.

Request access
to our connectors
Get instant access to our existing integrations
or request a custom build.
If it’s not in our connector library yet, we’ll make it
available for you within 10 days.

