feat: Waiting for embedding & rerank models
hkwz2023 opened this issue · comments
hkwz2023 commented
It would be perfect if server could add embedding and reRank model loading and corresponding api, looking forward to it.
Brad commented
Seconding this as currently places like LocalAI and Xinterface has them, and Ollama+LiteLLM might integrate Rerank models as well.