[Request]: Expose KV Cache Reset functionality.

Question

Nick-infinity opened this issue 7 months ago · comments

I think the kv cache for llm inference is maintained internally by mnn. Is it possible to reset the kv cache manully using an API call?

github-actions · Answer 1 · Thu Apr 18 2024 17:26:34 GMT+0800 (China Standard Time)

Marking as stale. No activity in 30 days.