Go API: Allow providing custom base cache #7329

anderseknert · 2025-01-30T09:32:22Z

The same way it's possible to provide a VirtualCache, it should be possible to bring your own BaseCache. In clients like Regal (when invoked via regal lint), the base data is only ever loaded once and doesn't change later. Yet currently, each evaluation (of which there are hundreds) will have a new cache instantiated, leading to unnecessary cache misses.

Since our inmem-store is AST-based, we could also try to have the base cache tap directly into the store for a 100% hit ratio, avoiding the more costly storage queries. But that's still untested this point 🤓

(The tiny changes in resolver.go are unrelated to this feature but fix a perf issue I noticed last night as I was testing that functionality. Too small fix to warrant a PR of its own.)

srenatus

This reminds me, have you ever tested the new AST-Values-in-inmem-storage feature that shipped some time ago?

v1/topdown/cache.go

v1/topdown/query.go

The same way it's possible to provide a VirtualCache, it should be possible to bring your own BaseCache. In clients like Regal (when invoked via `regal lint`), the base data is only ever loaded once and doesn't change later. Yet currently, each evaluation (of which there are hundreds) will have a new cache instantiated, leading to unnecessary cache misses. Since our inmem-store is AST-based, we could also try to have the base cache tap directly into the store for a 100% hit ratio, avoiding the more costly storage queries. But that's still untested this point 🤓 (The tiny changes in resolver.go are unrelated to this feature but fix a perf issue I noticed last night as I was testing that functionality. Too small fix to warrant a PR of its own.) Signed-off-by: Anders Eknert <[email protected]>

anderseknert · 2025-01-30T11:36:26Z

This reminds me, have you ever tested the new AST-Values-in-inmem-storage feature that shipped some time ago?

Yes! It's enabled in Regal and helps quite a bit! That's what I was alluding to here:

Since our inmem-store is AST-based, we could also try to have the base cache tap directly into the store for a 100% hit ratio, avoiding the more costly storage queries. But that's still untested this point 🤓

Meaning even with that enabled, going to the cache is much cheaper than going via the storage API, where a single Read request costs 1 + x allocations just to convert a ref to path and back (where x is the length of the ref). That used to be 1 + 2x but was made a little cheaper recently by pooling. And that's just for the path — there are also costs attached to boxing values to interface{}, etc. The cache OTOH is cheap. Only downside is of course that it's somewhat silly to store the same data in the same format in two different places and call one of them a cache. That's why I'm thinking we could have our custom base cache utilize the same backing AST object as the store does. It would mean we have only one copy of the data, and that we avoid the cost of the storage API.

(EDIT: For the regal lint case, I think we could even skip providing a store entirely and load all the data into the base cache directly. That won't work for the language server though, as in that case the data is updated and we need to make sure that's transactional. We can make it so I guess, but then we've reinvented the storage API, lol)

The better solution would be a V2 storage API, and I think @johanfylling wished for that too after he worked on the AST backed inmem store. But that's for another day :)

anderseknert force-pushed the basecache branch from 270c91d to 4250404 Compare January 30, 2025 09:41

charlieegan3 approved these changes Jan 30, 2025

View reviewed changes

srenatus approved these changes Jan 30, 2025

View reviewed changes

v1/topdown/cache.go Outdated Show resolved Hide resolved

v1/topdown/query.go Show resolved Hide resolved

anderseknert force-pushed the basecache branch from 4250404 to a7956af Compare January 30, 2025 11:21

anderseknert merged commit b751182 into open-policy-agent:main Jan 30, 2025
28 checks passed

anderseknert deleted the basecache branch January 30, 2025 12:03

anderseknert mentioned this pull request Feb 10, 2025

OPA v1.2.0 TODO StyraInc/regal#1405

Open

5 tasks

BrewTestBot mentioned this pull request Feb 28, 2025

opa 1.2.0 Homebrew/homebrew-core#209271

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Go API: Allow providing custom base cache #7329

Go API: Allow providing custom base cache #7329

anderseknert commented Jan 30, 2025

srenatus left a comment

anderseknert commented Jan 30, 2025 •

edited

Loading

Go API: Allow providing custom base cache #7329

Go API: Allow providing custom base cache #7329

Conversation

anderseknert commented Jan 30, 2025

srenatus left a comment

Choose a reason for hiding this comment

anderseknert commented Jan 30, 2025 • edited Loading

anderseknert commented Jan 30, 2025 •

edited

Loading