Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: cancel context on signals #3042

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

Conversation

alecthomas
Copy link
Collaborator

No description provided.

This was referenced Oct 8, 2024
// process group, and exits.
func RunWithSignalHandler(ctx context.Context, run func(ctx context.Context) error) error {
logger := log.FromContext(ctx)
ctx, stop := signal.NotifyContext(ctx, syscall.SIGINT, syscall.SIGTERM)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Doesn't the work around context cancellation happen asynchronously? Given that we immediately send SIGTERM won't it be non-deterministic as to which stuff actually sees the context cancellation?

Copy link
Collaborator Author

@alecthomas alecthomas Oct 9, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"run" won't return until it and all its child goroutines are complete, at which point there shouldn't be anything still running. This is actually an improvement on the previous behaviour, which listened for a signal and immediately ran kill + exit.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What makes 'run' not return until child goroutines complete?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The design of the function... using errgroups to track sub-tasks, etc...

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, so this needs follow up work to make sure this happens, because this is definitely not true at the moment.

If we do this properly we probably also want a timeout with some kind of dump threads functionality to diagnose shutdown hangs.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Correct, this is just the introduction of the basic cancellation.

Actually I may not be able to merge this. Turns out the only way the BuildEngine will terminate currently is by being literally killed, so yep.

@alecthomas alecthomas force-pushed the aat/signal-handlers branch 2 times, most recently from d65ce2d to a734fe4 Compare October 9, 2024 21:34
@alecthomas alecthomas marked this pull request as ready for review October 9, 2024 21:35
@alecthomas alecthomas requested review from a team as code owners October 9, 2024 21:35
@alecthomas alecthomas requested review from a team, stuartwdouglas, worstell and deniseli and removed request for a team October 9, 2024 21:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants