-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve logging and add OpenMetrics docs #21089
base: master
Are you sure you want to change the base?
Conversation
5d58530
to
3f80d5e
Compare
The slack channel recently swiftly educated me that for some metrics I need pull metrics from all nodes including workers and not just the coordinator, Prometheus supports k8s service discovery as standard and most implementations have a standard "prometheus-anotations" job configured, however in general that's unauthenticated. I created two copies of that, one for the coordinator and one for the workers because in my case:
@mattstep requested I share some Prometheus configuration required to sour Firstly in the helm for trino add the annotations
Then in the prometheus config
|
This pull request has gone a while without any activity. Tagging the Trino developer relations team: @bitsondatadev @colebow @mosabua |
This looks good to me, and I think would benefit from the more complex example from @lozbrown , this gives a more realistic configuration for a production usecase. |
I have a whole bunch of further info and details from @lozbrown and others that I will add .. just have to get back to working on this. |
I think it would be good to include a list of the things you typically monitor in your dashboard as experts. Possibly including promql for these |
@mosabua can we move along some sort of first attempt here on the basis something is better than completely undocumented features |
Its on my backlog but I am flat out busy .. I will try to get a minimal PR merged first and expand later at this stage. |
8108ac0
to
fd61470
Compare
@lozbrown I have updated this PR quite a bit now, but still work some of your material into it. However you can already review now Regarding #21089 (comment) .. what do you think @nineinchnick .. is there enough docs info about this in the Helm chart docs or do we want to add more and then maybe link back to the new page from this PR? Also @mattstep .. can you take a peek at the JSON and TCP logging stuff? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The Trino Helm chart doesn't really describe how to configure Prometheus. It can create a service monitor for the Prometheus Operator, but it also doesn't include any configuration.
IMO, it's better to expand this in the Trino docs, since they're way more discoverable.
5ddc73c
to
8ec0980
Compare
I have raised a new issue here to request openmetrics endpoint become (configurably) anonymous This complexity is the main reason many are still using the jmx_exporter to the point its been introduces into the helm chart as it allows anonymous scrape That would significantly reduce the complexity of scraping the metrics in autoscaling and kubernetes environments. I still feel that the running of prometheus is outside the scope of trino documentation but possibly a link to the annotation to cause metrics to be scraped would be useful. In many cases the tooling will be using things like grafana-agent grafana alloy opentelemetry agent to push data to a more modern stack like cortex / mimir as running prometheus in the above way is somewhat uncommon now. Other points:
|
Description
Additional context and related issues
Related to TCB 57
Fixes #20637
Release notes
(x) This is not user-visible or is docs only, and no release notes are required.