helm-charts

Kafka Connect

A Helm chart for Confluent Kafka Connect on Kubernetes

Introduction

This chart bootstraps a Kafka Connect using the Confluent stable version.

Kafka Connect is an open source framework, part of Apache Kafka, to stream data into and out of Apache Kafka.

Developing Environment

component version
Podman v4.3.1
Minikube v1.28.0
Kubernetes v1.25.3
Helm v3.10.2
Confluent Platform v7.3.0

Installing the Chart

Add the chart repository, if not done before:

helm repo add rhcharts https://ricardo-aires.github.io/helm-charts/

Kafka Connect lives outside of and separately from your Kafka brokers. Although Schema Registry is not a required service for Kafka Connect, it enables you to easily use Avro, Protobuf, and JSON Schema as common data formats for the Kafka records that connectors read from and write to.

By default this chart is set to use the umbrella chart kstack, but can be run against an external Kafka and Schema Registry by passing:

helm install --set kafka.enabled=false --set kafka.bootstrapServers=PLAINTEXT://kstack-kafka-headless.default:9092 --set schema-registry.enabled=false --set schema-registry.url=kstack-schema-registry.default:8081 ktool rhcharts/schema-registry
NAME: ktool
LAST DEPLOYED: Tue Mar 23 18:35:37 2021
NAMESPACE: default
STATUS: deployed
REVISION: 1
NOTES:
** Please be patient while the kafka-connect chart is being deployed in release ktool **

This chart bootstraps a Confluent Schema Registry that can be accessed from within your cluster:

    ktool-kafka-connect.default:8083

$

These commands deploy Kafka Connect on the Kubernetes cluster in the default configuration. The Parameters section lists the parameters that can be configured during installation.

One can run the:

To uninstall the ktool deployment run:

helm uninstall ktool

The command removes all the Kubernetes components associated with the chart and deletes the release.

Parameters

You can specify each parameter using the --set key=value[,key=value] argument to helm install.

Alternatively, a YAML file that specifies the values for the parameters can be provided while installing the chart. For example,

helm install ktool -f my-values.yaml rhcharts/kafka-connect

A default values.yaml is available and should be checked for more advanced usage.

Image

By default the confluentinc/cp-kafka-connect is in use.

Parameter Description Default
image.registry Registry used to distribute the Docker Image. docker.io
image.repository Docker Image of Confluent Kafka Connect. confluentinc/cp-kafka-connect
image.tag Docker Image Tag of Confluent Kafka Connect . 7.3.0

One can easily change the image.tag to use another version. When using a local/proxy docker registry we must change image.registry as well.

Standalone vs Distributed Mode

Kafka Connect currently supports two modes of execution:

The default is to only have one replica, thus standalone, but we can change the number of replicas using replicaCount.

Confluent Kafka Connect Configuration

The next configuration related to Kafka Connect are available:

Parameter Description Default
keyConverter Converter class for key Connect data. io.confluent.connect.avro.AvroConverter
valueConverter Converter class for value Connect data. io.confluent.connect.avro.AvroConverter
storageReplicatorFactor The replication factor used when Kafka Connects creates internals config, offset and status topics 3
pluginPath The comma-separated list of paths to directories that contain Kafka Connect plugins. /usr/share/java,/usr/share/confluent-hub-components

More information can be found in the Confluent Documentation.

Ports used by Kafka Connect

By default the Service will expose the pods in the port 8083, port.

Kerberos authentication

This chart is prepared to enable Kerberos authentication in Zookeeper

Parameter Description Default
kerberos.enabled Boolean to control if Kerberos is enabled. false
kerberos.krb5Conf Name of the ConfigMap that stores the krb5.conf, Kerberos Configuration file nil¹
kerberos.keyTabSecret Name of the Secret that stores the Keytab nil¹
kerberos.jaasConf Name of the ConfigMap that stores the JAAS configuration files per host. nil¹
kafka.kerberos.serviceName Primary of the Principal (user, service, host) used to connect to Kafka Brokers nil
kafka.kerberos.domain REALM of the Principal used to connect to Kafka Brokers nil

¹ When kerberos.enabled these parameters are required, and the ConfigMap and Secret need to exist before.

Resources for Containers

Regarding the management of Resources for Containers the next defaults regarding requests and limits are set:

Parameter Description Default
resources.limits.cpu a container cannot use more CPU than the configured limit 1
resources.limits.memory a container cannot use more Memory than the configured limit 1400Mi
resources.requests.cpu a container is guaranteed to be allocated as much CPU as it requests 250m
resources.requests.memory a container is guaranteed to be allocated as much Memory as it requests 512Mi

In terms of the JVM the next default is set:

Parameter Description Default
heapOpts The JVM Heap Options for Kafka Connect. "-XX:MaxRAMPercentage=75.0 -XX:InitialRAMPercentage=50.0"

Advance Configuration

Check the values.yaml for more advance configuration such as: