Better memory limit support

Question

Better memory limit support

Suor opened this issue 9 years ago · comments

Alexander Schepanovski commented 9 years ago

For now cacheops offers 2 imperfect strategies to handle that. They both have flaws. I create this issue to track the topic.

Alexander Schepanovski · Answer 1 · Tue Apr 28 2015 14:32:25 GMT+0800 (China Standard Time)

Alternative strategies available now:

Switch off maxmemory. Use external periodic job to make custom cleanup when memory usage exceeds limit. Cons: clunky, lag before eviction can cause arbitrary memory use.
Use keyspace notifications and external daemon to subscribe and manage cache structure. Cons: clunky, can miss events upon disconnect, is async so eviction could delete more than needed.
Store set of conj keys in a cache key and check integrity on cache fetch. Cons: slower fetch, substantial code complication.

Alexander Schepanovski · Answer 2 · Tue Apr 28 2015 14:34:29 GMT+0800 (China Standard Time)

The ideal solution would be custom eviction strategy, probably lua-based - redis/redis#2319. Another good solution could be managing cache structure with lua script subscribed to keyspace notification - redis/redis#2540.

Glenn Poston · Answer 3 · Wed May 18 2016 01:34:04 GMT+0800 (China Standard Time)

@Suor, what are the chances that you can provide a script (or guidance on what the script needs to do) for option 1. We need to put something in place until cacheops supports a solid solution natively (hopefully option 2 or 3).

Alexander Schepanovski · Answer 4 · Wed May 18 2016 16:09:17 GMT+0800 (China Standard Time)

I won't provide a script, but I can elaborate on strategy:

use INFO MEMORY command to find out if usage is above limit,
select some keys with RANDOMKEY, choose conj:* from them,
for each conjuction key, select its members and delete those keys with conjunction key itself:

keys = redis_client.smembers(conj_key)
redis_client.delete(*([conj_key] + keys))

(last one is better to run in Lua for atomicity)

Alexander Schepanovski · Answer 5 · Wed May 18 2016 16:20:39 GMT+0800 (China Standard Time)

The alternative, let's call it strategy 1a is probably better:

use CACHEOPS_LRU = True and maxmemory-policy volatile-lru (second strategy from README)
periodically SCAN for conjuction keys and remove them if they are orphant:

for conj_key in redis_client.scan_iter(match='conj:*'):
    keys = redis_client.smembers(conj_key)
    exists = redis_client.execute_command('EXISTS', *keys)
    if exists = 0:
        redis_client.delete(conj_key)

(the innards of the loop should be done with Lua for atomicity)

Edit: maxmemory-policy volatile-ttl changed to volcatile-lru, which one used by second README strategy.

Casey Meyer · Answer 6 · Wed Jan 17 2018 10:47:33 GMT+0800 (China Standard Time)

Hi all. I'm trying to understand what it means to use the eviction policy recommended in the README, which is CACHEOPS_LRU = True and maxmemory-policy volatile-lru. If I run my cache like this, do I lose the ability to expire cached views based on time? Is the only way to remove an item from the cache to let it get 'pushed out' by newer items?

What I want is to have my view cache expire after 24 hours like normal, BUT if I hit the max memory limit, the oldest items are pushed out to make room for the new ones.

Alexander Schepanovski · Answer 7 · Thu Jan 18 2018 00:07:13 GMT+0800 (China Standard Time)

No you don't loose ability to expire by time. The only downside is that invalidation structures can clutter your redis db over time, cache keys are still evicted by timeout.

Casey Meyer · Answer 8 · Thu Jan 18 2018 03:08:51 GMT+0800 (China Standard Time)

Oh ok. So if I understand this right, two keys are created for each item that is cached, one is the actual content and the other is the invalidation instructions. With the method I mentioned the content keys will be removed, but the invalidate keys will remain? And if I run the conj_key function as a management command every so often those invalidation keys will be cleared out?

Alexander Schepanovski · Answer 9 · Thu Jan 18 2018 10:59:00 GMT+0800 (China Standard Time)

Several conj_keys refer to single cache key, here is the description of how it works. When you use CACHEOPS_LRU = True conj_keys are not evicted by time, so they may clutter up, referencing non-existing cache keys. They are still removed on corresponding events so this might be not an issue.

There is no such thing as conj_key function. You basically need to go through conj_keys and check if they refer only non-existing cache keys and remove them if they are, I wrote the draft above. It could be improved though - remove non-existing cache keys from conj key instead of checking all of them and removing the whole set only:

for conj_key in r.scan_iter(match='conj:*'):
    for cache_key in r.smembers(conj_key):
        # These two lines should be done atomically
        if not r.exists(cache_key):
            r.srem(conj_key, cache_key)

Redis automatically removes keys for empty sets, so that's it.

Casey Meyer · Answer 10 · Fri Jan 19 2018 23:09:46 GMT+0800 (China Standard Time)

Thank you for the quick reply. I’m trying out this strategy and will report back.

Rick van Hattem · Answer 11 · Thu Apr 18 2019 00:44:07 GMT+0800 (China Standard Time)

I know it would take a large amount of effort to do right, but I think it would be beneficial if we could configure multiple cache backends. That would make this memory limit issue also easily solvable by running multiple redis instances (which many people do already since redis is single-cpu).

Alexander Schepanovski · Answer 12 · Thu Apr 18 2019 14:29:09 GMT+0800 (China Standard Time)

This has nothing to do with other backends. BTW cacheops doesn't use other backends because it uses sets and set operations in redis, which other backends just don't provide.

Rick van Hattem · Answer 13 · Thu Apr 18 2019 16:37:58 GMT+0800 (China Standard Time)

You misunderstood. I'm talking about multiple redis servers so you can have memory limits through redis.

Alexander Schepanovski · Answer 14 · Thu Apr 18 2019 17:09:20 GMT+0800 (China Standard Time)

Multiple redises have nothing to do with memory limit.

Rick van Hattem · Answer 15 · Thu Apr 18 2019 17:14:17 GMT+0800 (China Standard Time)

I don't see why not? Youl could have multiple redis servers and you can specify the maxmemory per server separately.

For example, assuming you have your sessions in redis you want to be absolutely certain they will never reach an out-of-memory scenario. Whereas many cache layers don't haver any real priority so you can set that server to allkeys-lru so you omit the need for a setex or expire.

Alexander Schepanovski · Answer 16 · Thu Apr 18 2019 17:52:45 GMT+0800 (China Standard Time)

All this doesn't matter from cacheops implementation point of view multiple server support and memory limit are completely independent issues. There is no reason to talk about multiple servers or backends here.

Alexander Schepanovski · Answer 17 · Sat Feb 25 2023 21:53:22 GMT+0800 (China Standard Time)

Using CACHEOPS_INSIDEOUT = True is a blessed way to solve this now, see Using memory limit