-
Notifications
You must be signed in to change notification settings - Fork 602
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kafka/server: add metrics and config for consumer lag reporting #24977
base: dev
Are you sure you want to change the base?
kafka/server: add metrics and config for consumer lag reporting #24977
Conversation
Note that this commit contains only the metric infrastructure, i.e. the probe and the mechanism to dynamically enable/disable these metrics. A subsequent commit will implement the logic to populate the consumer lag metrics data.
CI test resultstest results on build#61359
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice!
@@ -965,6 +967,7 @@ class group final : public ss::enable_lw_shared_from_this<group> { | |||
chunked_hash_map<model::topic_partition, offset_metadata> | |||
_pending_offset_commits; | |||
enable_group_metrics _enable_group_metrics; | |||
config::binding<bool> _enable_consumer_lag_metrics; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: you aren't using or assigning this in this commit so it should be moved to a different commit
# wait for some messages | ||
wait_until( | ||
lambda: ConsumerGroupTest.group_consumed_at_least( | ||
consumers, 50 * len(consumers)), 30, 2) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: add an error message in wait_until
Implements: https://redpandadata.atlassian.net/browse/CORE-8914
Introduce "enable_consumer_group_lag_metrics" which controls whether the consumer lag metrics are active. This can be changed without needing a restart.
Introduce the metrics scaffolding needed to have metrics that can be enabled/disabled at runtime.
redpanda_kafka_consumer_group_lag_max
group
,shard
redpanda_kafka_consumer_group_lag_sum
group
,shard
Backports Required
Release Notes