Delete dry-run and get manager image free #1234

a-thaler · 2024-07-02T13:03:49Z

Description
As part of #767 the manager should get freed of the validating webhook for the LogPipeline, which does some validation and also executes the pipeline validation using the fluentbit dry-run mode.

The dry-run mode was initially introduced to have advanced validation options for the unsupported mode, so when providing free-style fluentbit filter and outputs. Here, the dry-run mode can bring value as it will check for syntax and semantic problems in the free-style texts.
The price for that feature is a hard dependency to the fluentbit binary as part of the manager container, incl debian base image usage.

When the dry-run mode got introduced, no e2e tests were written, so it is hard to say if that functionality is still revealing the expected outcome, manual tests are positive so far. However, the feedback very poor "error: logpipelines.telemetry.kyma-project.io "cls" is invalid", there is no detailed message to figure out what is wrong and you have no chance to look it up anywhere.

Also, leveraging the otel-collector in future, an unsupported mode will not be present anymore providing free-text possibilities.

Instead of moving the logic into a validation phase as part of the reconciler, we should just directly remove the feature, reducing complexity a lot and runtime dependencies. Instead, we should improve the agent health status to reflect startup problems in a meaningful way. If a pod exits with code 255 for example, we could use a dedicated reason with a message iondicating to check the pod logs, where a more advanced message can be found.

Criterias

The dry-run functionality is removed
The manager base image is "from scratch"
Make sure that If a configuration mistake in a custom component happens, the AgentHealthy condition should indicate a problem with the agent, and the message should indicate to check the logs

a-thaler added kind/feature Categorizes issue or PR as related to a new feature. area/logs LogPipeline labels Jul 2, 2024

a-thaler mentioned this issue Jul 2, 2024

Telemetry Beta API #767

Open

20 tasks

This was referenced Aug 6, 2024

feat: Remove dry-run webhook #1325

Merged

feat: Add indication to look at container logs #1340

Merged

jeffreylimnardy mentioned this issue Aug 13, 2024

chore: Clean up Dockerfile after removing dry-run feature #1342

Merged

5 tasks

jeffreylimnardy self-assigned this Aug 13, 2024

jeffreylimnardy closed this as completed Aug 19, 2024

a-thaler added this to the 1.22.0 milestone Aug 19, 2024

a-thaler mentioned this issue Aug 27, 2024

Remove validation webhook for LogPipelines #1392

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delete dry-run and get manager image free #1234

Delete dry-run and get manager image free #1234

a-thaler commented Jul 2, 2024 •

edited by skhalash

Loading

Delete dry-run and get manager image free #1234

Delete dry-run and get manager image free #1234

Comments

a-thaler commented Jul 2, 2024 • edited by skhalash Loading

a-thaler commented Jul 2, 2024 •

edited by skhalash

Loading