Error calling chat RTX as localhost

Question

Error calling chat RTX as localhost

manassm opened this issue a month ago · comments

I was trying to use chat RTX as localhost, and I got the following errors:

each time I run it, the two error messages on top alternates.

Thank you for your support.

LK Studio · Answer 1 · Sat Jun 08 2024 18:17:33 GMT+0800 (China Standard Time)

I am reading the docs over at Chat RTX and it doesn't seem that it is compatible with the OpenAI standard chat completions endpoints.

You might need to use another LLM server like ollama, vLLM, etc. or find a way to get the openai style chat endpoints for RTX

Rex Pirata · Answer 2 · Sun Jun 09 2024 01:49:23 GMT+0800 (China Standard Time)

I am getting a similar error pattern using Local LLM with Ollama and I also tried with LM Studio. I can see that the servers are chatting but there is always an error regarding something not defined whether it be 'maths' 'sin' etc. there is always an excuse.

I was looking over the Ollama server logs and noticed this:

level=WARN source=server.go:230 msg="multimodal models don't support parallel requests yet"

LK Studio · Answer 3 · Sun Jun 09 2024 02:12:06 GMT+0800 (China Standard Time)

what are your prompts?