Kafka Connect architecture

The following diagram represents the Kafka Connect architecture:

The Kafka cluster is made of Kafka brokers: three brokers, as shown in the diagram. Sources can be of any type, for example, databases, NoSQL, Twitter, and so on. In between the source and Kafka cluster, there is a Kafka Connect cluster, which is made up of workers. The working of Kafka Connect comprises the following steps:

  1. Workers, based on configuration, pull data from sources
  2. After getting data, the connector pushes data to the Kafka cluster
  3. If data needs to be transformed, filtered, joined, or aggregated using stream applications such as Spark, ...

Get Modern Big Data Processing with Hadoop now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.