-
Notifications
You must be signed in to change notification settings - Fork 917
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Notifications not firing for Analysis Run fails when Analysis Run is part of an Experiment #4009
Labels
bug
Something isn't working
Comments
More rough notes from a conversation with @zachaller: Rollouts uses the notification engine in 2 ways
Path to explore Create a new notification controller / deployment - this is similar to what Argo CD does.
|
meeech
added a commit
to CircleCI-Public/argo-rollouts
that referenced
this issue
Feb 9, 2025
…he Experiment Addresses argoproj#4009. This change will fire Analysis Run events bound to the parent Rollout object when the Experiment is a Step in the Rollout.
meeech
added a commit
to CircleCI-Public/argo-rollouts
that referenced
this issue
Feb 9, 2025
…he Experiment Addresses argoproj#4009. This change will fire Analysis Run events bound to the parent Rollout object when the Experiment is a Step in the Rollout. Signed-off-by: mitchell amihod <[email protected]>
6 tasks
meeech
added a commit
to CircleCI-Public/argo-rollouts
that referenced
this issue
Feb 11, 2025
…he Experiment Addresses argoproj#4009. This change will fire Analysis Run events bound to the parent Rollout object when the Experiment is a Step in the Rollout. Signed-off-by: mitchell amihod <[email protected]>
meeech
added a commit
to CircleCI-Public/argo-rollouts
that referenced
this issue
Feb 11, 2025
…he Experiment Addresses argoproj#4009. This change will fire Analysis Run events bound to the parent Rollout object when the Experiment is a Step in the Rollout. Signed-off-by: mitchell amihod <[email protected]>
zachaller
pushed a commit
that referenced
this issue
Feb 14, 2025
* chore: ignore all debug_bin* Signed-off-by: mitchell amihod <[email protected]> * feat(experiments): Add a utility to check if an experiment belongs to a Rollout. We can then identify when an experiment is a Step in a Rollout. Signed-off-by: mitchell amihod <[email protected]> * chore: typo Signed-off-by: mitchell amihod <[email protected]> * feat(experiments): Fire k8s Event bound to the Rollout when it owns the Experiment Addresses #4009. This change will fire Analysis Run events bound to the parent Rollout object when the Experiment is a Step in the Rollout. Signed-off-by: mitchell amihod <[email protected]> * Loop through ownerReferences to find the rollout reference. If we pass belongs to rollout check, we know there is a rollout to find. Signed-off-by: mitchell amihod <[email protected]> * Tighten things up - don't need a bool - fetch the ref or nil Signed-off-by: mitchell amihod <[email protected]> --------- Signed-off-by: mitchell amihod <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I am starting this ticket to capture the information as I investigate and try to resolve this issue.
If anyone has any pointers or thoughts, please add them.
When an analysis run fail happens and that analysis run is part of an inline experiment step, we don't get the on-analysis-run-error or on-analysis-run-fail notification.
Analysis Run Error
✅ Background Analysis Run: event: AnalysisRunError object: rollout/basic-rollout
❌ Inline Step Analysis Run: event: AnalysisRunError object: experiment/basic-rollout-exp-steps-b66774df5-3-0
We get the RolloutAborted notification for both, because the event that fires belongs to the rollout/* object in both cases
Analysis Run Fail
✅ Background Analysis Run: event: AnalysisRunFailed object: rollout/basic-rollout
❌ Inline Step Analysis Run: event: AnalysisRunFailed object: experiment/basic-rollout-exp-steps-bd7bdfcc8-4-0
We get the RolloutAborted notification for both, because the event that fires belongs to the rollout/* object in both cases
So this has me thinking theres a few possible options:
I'll keep digging. Unsure what the ideal would be:
would we like something like
on-experiment-analysis-run-failed
,on-experiment-analysis-run-error
... or would things be better served with them using the already existing triggers? I think when its a step it would make sense to use the existing triggers, and have the rollout object available for the templates, but what about stand alone experiments?Version
1.7.2 (but this has existed as a problem as long as I've been using experiment step, so at least 1.5/1.6
Message from the maintainers:
Impacted by this bug? Give it a 👍. We prioritize the issues with the most 👍.
The text was updated successfully, but these errors were encountered: