Fix crash in replicationCacheMaster() expecting a nullptr cached_master #896

guillemj · 2025-02-11T12:48:54Z

The replicationCacheMaster() function expects the master pointer to be non-nullptr and the cached_master to be nullptr. But if we happened to get disconnected multiple times and thus also go multiple times through replicationCreateMasterClient(), then we end up with both master and cached_master being non-nullptr, which then triggers an assertion in replicationCacheMaster() which crashes the server.

When we are recreating the master struct, mark cached_master for asynchronous freeing, and reset its pointer.

Fixes: #849

This fixes this crash for us, but then we get after a bit another apparently unrelated crash, although that might be due to the database being damaged (which was the trigger in our case) or something else. But given the amount of crash reports filed, this does not seem surprising.

The replicationCacheMaster() function expects the master pointer to be non-nullptr and the cached_master to be nullptr. But if we happened to get disconnected multiple times and thus also go multiple times through replicationCreateMasterClient(), then we end up with both master and cached_master being non-nullptr, which then triggers an assertion in replicationCacheMaster() which crashes the server. When we are recreating the master struct, mark cached_master for asynchronous freeing, and reset its pointer. Fixes: Snapchat#849

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix crash in replicationCacheMaster() expecting a nullptr cached_master #896

Fix crash in replicationCacheMaster() expecting a nullptr cached_master #896

guillemj commented Feb 11, 2025

Fix crash in replicationCacheMaster() expecting a nullptr cached_master #896

Are you sure you want to change the base?

Fix crash in replicationCacheMaster() expecting a nullptr cached_master #896

Conversation

guillemj commented Feb 11, 2025