Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

thermalctld: Ignore exception when deleting chassisdb entry fails #585

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

patrickmacarthur
Copy link
Contributor

Description

After sonic-mgmt tests, it does a config consistency check and if the config consistency check fails, it runs recover_chassis(). That function simultaneously runs config_reload on the supervisor and all linecards (in parallel). This means that the supervisor networking can be in the process of restarting when thermalctld exits and goes through the cleanup process of deleting entries from chassisdb.

Motivation and Context

This fixes a thermalctld crash when the supervisor and linecards simultaneously do a config reload (fixes sonic-net/sonic-buildimage#21058). This causes a failure during sonic-mgmt testing.

How Has This Been Tested?

This change has been applied in Arista internal test environment and the thermalctl crash was not seen with this change applied.

Additional Information (Optional)

After sonic-mgmt tests, it does a config consistency check and if the
config consistency check fails, it runs recover_chassis(). That
function simultaneously runs config_reload on the supervisor and all
linecards (in parallel). This means that the supervisor networking can
be in the process of restarting when thermalctld exits and goes
through the cleanup process of deleting entries from chassisdb.
@mssonicbld
Copy link
Collaborator

/azp run

Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@rlhui rlhui requested a review from judyjoseph January 29, 2025 21:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

Successfully merging this pull request may close these issues.

thermalctld crash on exit if connection to chassisdb is broken
2 participants