viktor-ferenczi / vllm-client

vLLM client with minimal dependencies

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

vLLM Client

Overview

Client for the vLLM API with minimal dependencies.

Installation

pip install vllm-client

Examples

See example.py for the following:

  • Single generation
  • Streaming
  • Batch inference

It should work out of the box with a vLLM API server.

Notes

  • sampling_params.py needs to be kept in sync with vLLM. It is a simplified version of their class, containing only the code required on client side.

Another programming languages

About

vLLM client with minimal dependencies

License:Apache License 2.0


Languages

Language:Python 67.7%Language:C# 32.3%