GH-1246. Asynchronously initialize cache before reading #1247

kotman12 · 2025-02-20T22:32:34Z

Linked issue #1246

The current CachedModeledFrameworkImpl doesn't manage cache initialization for you. A perfect example of the kind of code you can expect to see in the wild is in the CachedModeledFramework tests themselves, i.e. blocking on semaphores that pin the reading thread to prevent it from certain operations that are cache dependent (but as far as I can tell exactly which operations are cache dependent is not really guaranteed so this is arguably cumbersome). Either way, this is fine in a lot of cases...

However, I propose an additional InitializedCachedModeledFramework implementation which asynchronously waits for the cache initialization trigger and only then proceeds to read from the cache. I implemented something similar for my personal use-case where I couldn't rely on, i.e. readThrough, to handle the uninitialized case because readThrough cannot disambiguate between a znode that is missing because it truly is missing from zk vs one that is missing because the cache hasn't initialized. In my case the znode wouldn't always exist in zk and so using readThrough would result in a lot of wasted calls to zk, greatly reducing the benefit of the cache in the first place.

To reiterate InitializedCachedModeledFramework has a couple benefits over the existing implementation:

No more possibility of misleading NoNodeException in the case of reading from CachedModeledFramework before the cache has warmed. (I say misleading because the node may exist .. just not in the cache)
No more temptation to add blocking semaphores in front of this non-blocking interface.
IMO generally less confusion about how to properly use this otherwise great(!) feature.

tisonkun

Thanks for your contribution @kotman12!

My extra consideration is whether the original manner has some benefits or we can even replace the old implement with the new one. Also compatibility is a consideration while we can bump a major version if needed and worthy.

Anyway, this patch itself LGTM.

kotman12 · 2025-03-01T21:28:10Z

Thanks for taking a look @tisonkun! I think I agree .. IMO in a majority of cases the "initialized" variety is what you want even though there may be some minimal overhead of going through/checking two futures instead of one. I don't know at what scale that matters, if at all, but didn't benchmark it so didn't want to jump to conclusions.

I'd be happy to refactor CachedModeledFrameworkImpl itself instead of adding another implementation. A benefit of this approach is that tests would be a lot simpler but I'm not sure if this is something that needs to be benchmarked? At any rate, this shouldn't introduce any backwards incompatibilities so not sure a major version bump is even needed to change CachedModeledFrameworkImpl to manage initialization.

A third option is to make InitializedCachedModeledFramework the default cached implementation returned by ModeledFramework::cached. This would make the user-friendlier implementation more discoverable while also giving hardcore users the option to use the original, unitialized version.

Of course I'll defer to the more experienced community here.

kotman12 · 2025-03-05T19:49:16Z

@tisonkun / @kezhuw please take a look at #1250 when you get a chance. I believe it is a less heavyweight implementation of the same thing we are doing here. Please don't merge this PR because while implementing #1250 I realized there are still gaps here. Rather than fix them I'll kindly first refer you to #1250 since if that is merged this one can be closed. I believe it should be backwards compatible and nearly identical from a performance standpoint.

implement CachedModeledFramework which waits for cache initialization

9891901

kotman12 force-pushed the Asynchronously-Initialize-Cache-Before-Reading branch from 753ba51 to 9891901 Compare February 21, 2025 21:24

kotman12 changed the title ~~Asynchronously initialize cache before reading~~ [Cirator 1246] Asynchronously initialize cache before reading Feb 26, 2025

kotman12 changed the title ~~[Cirator 1246] Asynchronously initialize cache before reading~~ [Curator 1246] Asynchronously initialize cache before reading Feb 26, 2025

kotman12 changed the title ~~[Curator 1246] Asynchronously initialize cache before reading~~ [CURATOR 1246] Asynchronously initialize cache before reading Feb 26, 2025

kotman12 marked this pull request as ready for review February 26, 2025 23:52

kotman12 changed the title ~~[CURATOR 1246] Asynchronously initialize cache before reading~~ [CURATOR-1246] Asynchronously initialize cache before reading Feb 26, 2025

tisonkun changed the title ~~[CURATOR-1246] Asynchronously initialize cache before reading~~ GH-1246. Asynchronously initialize cache before reading Feb 27, 2025

kotman12 added 2 commits February 27, 2025 11:52

apply spotless

01f46b7

framework -> client

5fd441b

tisonkun approved these changes Mar 1, 2025

View reviewed changes

tisonkun requested a review from kezhuw March 1, 2025 09:30

kotman12 mentioned this pull request Mar 5, 2025

GH-1246. Asynchronously initialize cache before reading II #1250

Open

kotman12 marked this pull request as draft March 5, 2025 19:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-1246. Asynchronously initialize cache before reading #1247

GH-1246. Asynchronously initialize cache before reading #1247

kotman12 commented Feb 20, 2025 •

edited

Loading

tisonkun left a comment •

edited

Loading

kotman12 commented Mar 1, 2025 •

edited

Loading

kotman12 commented Mar 5, 2025

GH-1246. Asynchronously initialize cache before reading #1247

Are you sure you want to change the base?

GH-1246. Asynchronously initialize cache before reading #1247

Conversation

kotman12 commented Feb 20, 2025 • edited Loading

tisonkun left a comment • edited Loading

Choose a reason for hiding this comment

kotman12 commented Mar 1, 2025 • edited Loading

kotman12 commented Mar 5, 2025

kotman12 commented Feb 20, 2025 •

edited

Loading

tisonkun left a comment •

edited

Loading

kotman12 commented Mar 1, 2025 •

edited

Loading