thangchung / spin-ai-models

A Spin plugin for installing LLMs into Spin applications

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

spin ai-models

A Spin plugin for downloading LLMs and installing them in the right place in your Spin app directory.

Installation

For now:

cargo build --release
spin pluginify --install

Usage

spin ai-models install

Options:

  • Name of the model (llama2-chat, codellama-instruct, or all-minikm-16-v2). If omitted it will prompt you (and allow multi select)
  • -f the app to install the models into

Known issues

Honestly most of the plugin is issues with just a thin thread of stuff that works, but the big one, the really big one is that it does not parallelise the download of these enormous great files. Another thing I would like to do is add the declarations to spin.toml, at least for the easy case where there's only one component.

If you liked this

The spin cloud-gpu plugin (https://github.com/fermyon/spin-cloud-gpu) lets you run supported models on Fermyon Cloud even while running your app locally. GPU acceleration, no downloading 7GB files, what's not to love?

About

A Spin plugin for installing LLMs into Spin applications


Languages

Language:Rust 100.0%