simonw / llm-anyscale-endpoints

LLM plugin for models hosted by Anyscale Endpoints

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

llm-anyscale-endpoints

PyPI Changelog Tests License

LLM plugin for models hosted by Anyscale Endpoints

Installation

First, install the LLM command-line utility.

Now install this plugin in the same environment as LLM.

llm install llm-anyscale-endpoints

Configuration

You will need an API key from Anyscale Endpoints. You can obtain one here.

You can set that as an environment variable called LLM_ANYSCALE_ENDPOINTS_KEY, or add it to the llm set of saved keys using:

llm keys set anyscale-endpoints
Enter key: <paste key here>

Usage

To list available models, run:

llm models list

You should see a list that looks something like this:

AnyscaleEndpoints: meta-llama/Llama-2-7b-chat-hf
AnyscaleEndpoints: meta-llama/Llama-2-13b-chat-hf
AnyscaleEndpoints: meta-llama/Llama-2-70b-chat-hf
AnyscaleEndpoints: codellama/CodeLlama-34b-Instruct-hf
AnyscaleEndpoints: mistralai/Mistral-7B-Instruct-v0.1
AnyscaleEndpoints: mistralai/Mixtral-8x7B-Instruct-v0.1
AnyscaleEndpoints: Open-Orca/Mistral-7B-OpenOrca
AnyscaleEndpoints: HuggingFaceH4/zephyr-7b-beta

To run a prompt against a model, pass its full model ID to the -m option, like this:

llm -m meta-llama/Llama-2-70b-chat-hf \
  'Five strident names for a pet walrus' \
  --system 'You love coming up with creative names for pets'

You can set a shorter alias for a model using the llm aliases command like so:

llm aliases set llama70b meta-llama/Llama-2-70b-chat-hf

Now you can prompt Llama 2 70B using:

cat llm_anyscale_endpoints.py | \
  llm -m llama70b -s 'explain this code'

Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

cd llm-anyscale-endpoints
python3 -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

pip install -e '.[test]'

To run the tests:

pytest

About

LLM plugin for models hosted by Anyscale Endpoints

License:Apache License 2.0


Languages

Language:Python 100.0%