FR Support local server for embeddings

Question

ArtificialAmateur opened this issue a month ago · comments

Jumping off of #302

Like the local server options for Smart Chat, similar work can be done for embeddings.

The OpenAI format API (which LM Studio and Ollama support) is /v1/embeddings

Daniel Demmel · Answer 1 · Mon Jul 01 2024 01:34:48 GMT+0800 (China Standard Time)

I'd love this, the embedded WASM models don't seem to saturate the CPU / GPU so it takes ages...

WFH Brian · Answer 2 · Sat Jul 06 2024 07:03:55 GMT+0800 (China Standard Time)

Makes sense. Thanks for the feature request 😊🌴