BUGFIX: Add exponential backoff when reading from Redis #3344

kdambekalns · 2024-04-18T08:22:12Z

When Redis is not ready (yet) when trying to read from the cache, this makes Flow wait and retry up to 8 times with an exponentially growing back-off time.

Fixes #3284

Review instructions

This might be hard to check in real life, sorry…

Checklist

Code follows the PSR-2 coding style
Tests have been created, run and adjusted as needed
The PR is created against the lowest maintained branch
Reviewer - PR Title is brief but complete and starts with FEATURE|TASK|BUGFIX
Reviewer - The first section explains the change briefly for change-logs
Reviewer - Breaking Changes are marked with !!! and have upgrade-instructions

Fixes neos#3284

bwaidelich · 2024-04-18T08:40:40Z

Neos.Cache/Classes/Backend/RedisBackend.php

+        do {
+            try {
+                return $this->uncompress($this->redis->get($this->getPrefixedIdentifier('entry:' . $entryIdentifier)));
+            } catch (\RedisException  $exception) {


Seems dangerous to me.. Isn't there at least a more specific exception that is thrown when "Redis is not ready (yet)"?

No, you can only inspect the error message.

But then again, even if it's something else, why not try again? What could be dangerous?

I'm not sure but I've seen so many errors in the past due to silently caught exceptions that those make me suspicious.
Also we have seen some performance issues with exponential back offs in the past (and are replacing those in Neos 9). For example: Some error in Redis will now lead to a 20s+ delay (if I counted correctly) until it is displayed. A wrong configuration could easily build up to kill the server upon deployments.

What about moving this logic to the getStatus() implementation and make sure that it is called upon deployment?
And/or implement the WithSetupInterface and put it there

Yeah, maybe logging the exceptions would be good – after all, at that point nothing should go wrong.

About killing something on a deployment: The use case for me here is exactly a deployment failure. 🙃

Adding something else to explicitly call won't happen (in "my" current project), as we have that workaround already: Call doctrine:compileproxies after deployment until no error appears… 🙈

About killing something on a deployment: The use case for me here is exactly a deployment failure.

Right, it fixes the error you described in the issue, but it might create a new one for misconfigured backends.
I won't block this, but I'm not a big fan as you might be able to tell :)
What about turning this into a composition, i.e. introduce some RetryBackend that can be wrapped around any other backend (similar to the MultiBackend)?

That is also an option…

I think that the PDO and Redis backend should both have a automatic retry implementation like this by default. In modern hosting environments, there will be movement and it can happen that a Redis is gone for a second or two once in a while. Therefore, it would be nice if a Flow dev wouldn't have to think about this and her application would just work.

I see the point of @bwaidelich that sometimes retries can worsen things. You need to apply the "circuit braker" design pattern for those cases. However, it depends …

I wouldn't like to introduce another wrapper backend since it makes the setup complicated and I think that a reasonable retry is what most people want.

How about making this configurable (on / off and number of retries) using the same option names for RedisBackend, PDOBackend and any other backend which could profit from this?

🙈

BUGFIX: Add exponential backoff when reading from Redis

db497e5

Fixes neos#3284

kdambekalns requested review from bwaidelich, kitsunet and daniellienert April 18, 2024 08:22

kdambekalns self-assigned this Apr 18, 2024

github-actions bot added Bug 8.3 labels Apr 18, 2024

bwaidelich reviewed Apr 18, 2024

View reviewed changes

Use ** instead of ^ for exponentiation

1f63a30

🙈

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUGFIX: Add exponential backoff when reading from Redis #3344

BUGFIX: Add exponential backoff when reading from Redis #3344

kdambekalns commented Apr 18, 2024

bwaidelich Apr 18, 2024

kdambekalns Apr 18, 2024

bwaidelich Apr 18, 2024

kdambekalns Apr 18, 2024

bwaidelich Apr 18, 2024

kdambekalns Apr 29, 2024

robertlemke Sep 26, 2024

BUGFIX: Add exponential backoff when reading from Redis #3344

Are you sure you want to change the base?

BUGFIX: Add exponential backoff when reading from Redis #3344

Conversation

kdambekalns commented Apr 18, 2024

bwaidelich Apr 18, 2024

Choose a reason for hiding this comment

kdambekalns Apr 18, 2024

Choose a reason for hiding this comment

bwaidelich Apr 18, 2024

Choose a reason for hiding this comment

kdambekalns Apr 18, 2024

Choose a reason for hiding this comment

bwaidelich Apr 18, 2024

Choose a reason for hiding this comment

kdambekalns Apr 29, 2024

Choose a reason for hiding this comment

robertlemke Sep 26, 2024

Choose a reason for hiding this comment