egxn / boo

πŸ… Boo wraps tts, sst, llm on a simple REST API to make it easy to use them in local apps. 🐯

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

🐯 Boo

Boo is a local REST API to use ML models in local desktop apps.

Models

You need install each tool following the instructions in the model's repository.

  • 🐸 Coqui for text to speech
  • πŸ¦™ Llama for large language models
  • πŸ‘οΈβ€πŸ—¨οΈ PaddleOCR for OCR (in progress)
  • πŸ‘„ Whisper for speech to text

How it works

  • ⚑ FastAPI for the REST API and websockets.
  • πŸŸ₯ Redis for queueing.

Additional requirements are listed in the requirements.txt file.

Server Development

./run.sh

How to use it in your app

  1. Create a new websocket connection to ws://localhost:5000/ws/{client_id}
  2. Consume the API (using the same the client_id).
  3. The tasks are queued using the REST API and the outcome will be sent using the websocket.

Example clients list

Web addon for chapGPT

Adds a button to the right of the <p> tags with the p selected to generate audio from the selected text and a button on the bottom input to transcribe audio to text.

Web addon Parrot

Adds a parrot button at the right of to the <p> tags with the p selected to generate audio from the selected text.

About

πŸ… Boo wraps tts, sst, llm on a simple REST API to make it easy to use them in local apps. 🐯


Languages

Language:Python 45.4%Language:JavaScript 25.4%Language:TypeScript 21.1%Language:CSS 4.5%Language:HTML 3.1%Language:Shell 0.5%