pcdinh / LibreASR

:speech_balloon: An On-Premises, Streaming Speech Recognition System

Home Page:https://github.com/iceychris/LibreASR

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LibreASR

Code style: black License: MIT Discord Shield

An On-Premises, Streaming Speech Recognition System

Built with PyTorch and fastai

LibreASR in Action

Example Apps

React Web App ESP32-LyraT

Quickstart

docker run -it -p 8080:8080 iceychris/libreasr:latest

Head your browser to http://localhost:8080/

Features

Performance

Model Dataset Network Params CER (dev) WER (dev)
english 1400h 6-2-1024 70M 18.9 23.8
german 800h 6-2-1024 70M 23.2 37.6

While this is clearly not SotA, training the models for longer and on multiple GPUs (instead of a single 2080 ti) would yield better results.

See releases for pretrained models.

Training

Language Model

See this colab notebook or use this notebook.

Contributing

Feel free to open an issue, create a pull request or join the Discord.

You may also contribute by training a large model for longer.

References & Credits

About

:speech_balloon: An On-Premises, Streaming Speech Recognition System

https://github.com/iceychris/LibreASR

License:MIT License


Languages

Language:Python 76.9%Language:Jupyter Notebook 8.0%Language:JavaScript 6.3%Language:C 5.6%Language:HTML 1.2%Language:Makefile 1.0%Language:CSS 0.4%Language:Shell 0.4%Language:Nix 0.2%Language:CMake 0.2%