yuntaozhu / atra

An open source ASR as a service Server focused on providing state of the art speech recognition systems with ease

Home Page:https://atra.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ASR as a service

This project focuses on providing speech recognition across industries and languages.

The project is being developed on a private basis with support from A\Ware and Reibke. Currently I'm focusing on languages used in the German area, but I'm happy about any support for other languages.

With the exception of the audio dataset and training script (at the moment, coming in future), all code, used models and datasets are released under open source license. An example of training large ASR models on small hardware take a look at the simplepeft project https://github.com/flozi00/simplepeft

How to install

Install the ffmpeg package for loading the audio files and python3 to run the code

With both packages installed you can just run "pip install -r requirements.txt" to install the python librarys and then start the server with gradio UI using "python app.py" For faster deployment you can even use the docker container https://hub.docker.com/r/flozi00/asrasaservice

About

An open source ASR as a service Server focused on providing state of the art speech recognition systems with ease

https://atra.ai

License:MIT License


Languages

Language:JavaScript 95.3%Language:Python 1.7%Language:Dart 1.4%Language:C++ 0.7%Language:CMake 0.6%Language:HTML 0.1%Language:C 0.0%Language:Swift 0.0%Language:Dockerfile 0.0%Language:Kotlin 0.0%Language:Batchfile 0.0%Language:Objective-C 0.0%