in_kafka: boost throughput #9800

coreidcc · 2025-01-06T09:24:19Z

We have a Kafka cluster that ingests about 40k messages (about 60MB) of data per seconds. Fluent-bit in its current state stands no change to keep up with this load. Even Logstash is faster and vector is just consuming all these messages with ease.

Causes:
a) commits each message individually
b) a poll-timeout of just one 1ms (this completely overrides fetch.wait.max.ms from kafka)

probably related to "Batch processing is required in in_kafka. #8030"

Testing: To activate the changes one need to

[INPUT]
Name kafka
threaded true -> sets timeout fetch.wait.max.ms + 50ms (align our and kafkas timeout, ensures kafka triggers timeout)
enable_auto_commit true -> disable explicit commit call

-> The change doesn't do any dynamic allocations at all and therefore cant introduce any mem-leaks
-> The change has no impact on packaging

Throughput increased by more then a magnitude.

plugins/in_kafka/in_kafka.c

cosmo0920

I found another possible typos in comments. Could you take a look on it?

plugins/in_kafka/in_kafka.c

cosmo0920

I'd recommended that timeount should be written as timeout.
This could be possible typos.

plugins/in_kafka/in_kafka.h

plugins/in_kafka/in_kafka.c

cosmo0920 · 2025-01-14T09:42:43Z

plugins/in_kafka/in_kafka.c

+        dsize = sizeof(conf_val);
+        res = rd_kafka_conf_get(kafka_conf, "fetch.wait.max.ms", conf_val, &dsize);
+        if (res == RD_KAFKA_CONF_OK && dsize <= sizeof(conf_val)) {
+            /* add 50ms so kafa triggers timeout */


kafa -> kafka

plugins/in_kafka/in_kafka.c

Polling every 1ms and committing each message individually results in rather pure performance in high volume Kafka clusters. Commiting in batches (relay on auto-commit of kafka) drastically improves performance. Signed-off-by: CoreidCC <[email protected]>

having 1ms timeout might make sense if the input plugin is running in the main thread (not introducing delay for others). but if we run in our very own thread then we should not over- ride the fetch.wait.max.ms configuration value from the kafka-consumer. this in conjuntion with using autocommit again boosts the throuhput significantly. Signed-off-by: CoreidCC <[email protected]>

Signed-off-by: CoreidCC <[email protected]>

cosmo0920

Basically, this patch sounds good. Would you mind if you add a unit test for confirming the newly introduced parameter like as?

https://github.com/fluent/fluent-bit/blob/master/tests/runtime/out_kafka.c

Just confirming that handling the newly introduced enable_auto_commit is able to be handled is enough for now.

cosmo0920

This PR looks good to me. It would be nice to have test for newly introduced parameter but it's not mandatory for now, I believe.

coreidcc requested review from edsiper, leonardo-albertovich, fujimotos and koleini as code owners January 6, 2025 09:24

github-actions bot added the docs-required label Jan 6, 2025

coreidcc changed the title ~~Coreidcc~~ in_kafka: boost throughput Jan 6, 2025

coreidcc force-pushed the coreidcc branch from 351625f to 05bb47f Compare January 6, 2025 09:52

coreidcc temporarily deployed to pr January 6, 2025 23:44 — with GitHub Actions Inactive

coreidcc temporarily deployed to pr January 7, 2025 00:06 — with GitHub Actions Inactive

edsiper assigned cosmo0920 Jan 8, 2025

edsiper added this to the Fluent Bit v4.0.0 milestone Jan 8, 2025

cosmo0920 reviewed Jan 9, 2025

View reviewed changes

coreidcc force-pushed the coreidcc branch 2 times, most recently from da84b9d to 9bcdcbc Compare January 9, 2025 08:03

coreidcc requested a review from cosmo0920 January 9, 2025 08:05

coreidcc temporarily deployed to pr January 9, 2025 08:21 — with GitHub Actions Inactive

coreidcc temporarily deployed to pr January 9, 2025 08:44 — with GitHub Actions Inactive

coreidcc temporarily deployed to pr January 9, 2025 08:45 — with GitHub Actions Inactive

coreidcc force-pushed the coreidcc branch 2 times, most recently from b9a3d68 to b46caea Compare January 9, 2025 09:25

patrick-stephens added the ok-package-test Run PR packaging tests label Jan 9, 2025

coreidcc temporarily deployed to pr January 9, 2025 11:23 — with GitHub Actions Inactive

cosmo0920 reviewed Jan 14, 2025

View reviewed changes

plugins/in_kafka/in_kafka.c Outdated Show resolved Hide resolved

coreidcc force-pushed the coreidcc branch from ae1aa0d to b1337b4 Compare January 14, 2025 07:58

coreidcc temporarily deployed to pr January 14, 2025 09:40 — with GitHub Actions Inactive

cosmo0920 requested changes Jan 14, 2025

View reviewed changes

coreidcc temporarily deployed to pr January 14, 2025 10:01 — with GitHub Actions Inactive

coreidcc force-pushed the coreidcc branch from b1337b4 to ce7651b Compare January 14, 2025 11:41

coreidcc added 3 commits January 14, 2025 20:27

in_kafka: make pull timeout configurable

cd5375f

Signed-off-by: CoreidCC <[email protected]>

coreidcc force-pushed the coreidcc branch from ce7651b to db8682a Compare January 14, 2025 19:28

coreidcc temporarily deployed to pr January 15, 2025 06:42 — with GitHub Actions Inactive

coreidcc temporarily deployed to pr January 15, 2025 07:06 — with GitHub Actions Inactive

in_kafka: formatting adjustments and typos

2721ce0

Signed-off-by: CoreidCC <[email protected]>

coreidcc force-pushed the coreidcc branch from db8682a to 2721ce0 Compare January 16, 2025 05:16

cosmo0920 reviewed Jan 16, 2025

View reviewed changes

coreidcc temporarily deployed to pr January 23, 2025 08:28 — with GitHub Actions Inactive

cosmo0920 approved these changes Jan 23, 2025

View reviewed changes

coreidcc temporarily deployed to pr January 23, 2025 08:50 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in_kafka: boost throughput #9800

in_kafka: boost throughput #9800

coreidcc commented Jan 6, 2025

cosmo0920 left a comment

cosmo0920 left a comment •

edited

Loading

cosmo0920 Jan 14, 2025

cosmo0920 left a comment

cosmo0920 left a comment

in_kafka: boost throughput #9800

Are you sure you want to change the base?

in_kafka: boost throughput #9800

Conversation

coreidcc commented Jan 6, 2025

cosmo0920 left a comment

Choose a reason for hiding this comment

cosmo0920 left a comment • edited Loading

Choose a reason for hiding this comment

cosmo0920 Jan 14, 2025

Choose a reason for hiding this comment

cosmo0920 left a comment

Choose a reason for hiding this comment

cosmo0920 left a comment

Choose a reason for hiding this comment

cosmo0920 left a comment •

edited

Loading