Ollama Integration using single processing at a time

Question

Ollama Integration using single processing at a time

Namangarg110 opened this issue 2 years ago · comments

I modified code to incorporate Ollama into the SuperJSON. However, It is unable to do batch processing. If you think it would be considerable placeholder until Ollama adds batch processing I can make a PR.

Best,
Naman

Varun Shenoy · Answer 1 · Wed Feb 14 2024 08:38:28 GMT+0800 (China Standard Time)

This is a known issue. I'm going to wait until Ollama or llama-cpp-python natively supports batching before accepting any PRs.