Event-driven analytics pipeline

CDC: Debezium captured changes from source DBs and published to Kafka.
Stream processing: Apache Flink jobs consumed topics, transformed and enriched events, and wrote to ClickHouse and MongoDB.
Python: Supporting services and glue code in Python for flexibility and data-team collaboration.

Problem

Conicle needed real-time analytics from operational databases without blocking production or building one-off ETL jobs. Data had to flow into analytics storage (ClickHouse, MongoDB) with low latency and clear ownership.

System design

Architecture

Impact