h2oai / h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Home Page:http://h2o.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

vLLM GROQ issue

chrissas opened this issue · comments

@pseudotensor

GROQ url mentioned in the documentation is wrong, I had confirmation from the groq team https://github.com/h2oai/h2ogpt/blob/main/docs/FAQ.md.

https://api.groq.com/openai:None:/v1: is deprecated and unused.

image

I tried their url but I still get this error
Exception: Error code: 404 - {'error': {'message': 'Unknown request URL: POST /openai/v1/completions:/completions. Please check the URL for typos, or see the docs at https://console.groq.com/docs/', 'type': 'invalid_request_error', 'code': 'unknown_url'}}
image

command :
python generate.py --model_lock="[{'inference_server':'vllm:https://api.groq.com/openai/v1/chat/completions:', 'base_model':'mixtral-8x7b-32768', 'max_seq_len': 31744, 'prompt_type':'plain'}]"

command :
python generate.py --model_lock="[{'inference_server':'vllm:https://api.groq.com/openai:None:/v1/chat:GroqAPIkey', 'base_model':'mixtral-8x7b-32768', 'max_seq_len': 31744, 'prompt_type':'plain'}]"

image

Use the other way mentioned for groq a bit lower in the FAQ, i.e.

image