nat / openplayground

An LLM playground you can run on your laptop

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

i added hugging face read api key, but when i try to run a query i get a 503

ralyodio opened this issue · comments

i added hugging face read api key, but when i try to run a query i get a 503

INFO:server.lib.inference:Requesting inference from databricks/dolly-v2-12b on huggingface
INFO:werkzeug:127.0.0.1 - - [24/Apr/2023 23:37:51] "POST /api/inference/text/stream HTTP/1.1" 200 -
ERROR:server.lib.inference:Error: Request failed: 503 Service Unavailable 

how do i fix?

seems like this is the huggingface API, you'll have to wait a bit for the model to "warm up" before you can call for inference. https://huggingface.co/docs/api-inference/faq#:~:text=Rate%20limits&text=We%20try%20to%20balance%20the,errors%20saying%20models%20are%20loading.