Skip to content
Modern Lakehouse Concepts & Interoperability Last updated: May 29, 2026

Debezium CDC Engines

The standard open-source framework built on Apache Kafka Connect that monitors database transaction logs and translates change events into streaming messages.

debeziumdebezium enginechange data capture debeziumkafka connect debezium

Debezium CDC Engines

Debezium CDC Engines refer to the connector instances within the Debezium framework, an open-source, distributed platform for change data capture. Built primarily on top of Apache Kafka Connect, Debezium monitors source databases, captures inserts, updates, and deletes, and streams those changes as event messages to Kafka topics. This framework is a primary data source for real-time lakehouse ingestion pipelines.

How Debezium Connectors Work

Debezium deploys specialized connectors for different database systems (such as PostgreSQL, MySQL, SQL Server, MongoDB, and Oracle).

Ingestion Integration

Downstream stream engines (like Flink, Spark, or PyIceberg-based streaming services) ingest Debezium events from Kafka. Many ingestion libraries include native Debezium format parsers, which automatically translate Debezium’s before/after schema messages into Iceberg INSERT, UPDATE, and DELETE operations, simplifying the implementation of real-time transactional data lakes.

📚 Go Deeper on Apache Iceberg

Alex Merced has authored three hands-on books covering Apache Iceberg, the Agentic Lakehouse, and modern data architecture. Pick up a copy to master the full ecosystem.

← Back to Iceberg Knowledge Base