[optimization] caching requests, etc.

Question

louis030195 opened this issue 6 months ago · comments

Louis Beaumont · Answer 1 · Thu Jan 25 2024 04:27:36 GMT+0800 (China Standard Time)

GPTCache only cache the retrieval part

in assistants we could cache:

in redis for example

i mean there are thousands way to slash latency and cost, not very difficult problem