codefuse-ai / ModelCache

A LLM semantic caching system aiming to enhance user experience by reducing response time via cached query-result pairs.

codefuse-ai/ModelCache Issues

[Feature: Ranking ability] Add ranking model to refine the order of data after embedding recall
Closed 2 months ago1
Can ModelChat be used in FastChat?
Updated 2 months ago1
Params not used in code
Closed 2 months ago1
cache是基于prompt的缓存？
Closed 8 months ago1
非常感谢蚂蚁开源code模型
Closed 10 months ago3