mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Project DeepSpeech

Documentation

macOS builds

Linters

Docker Images

DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier.

Documentation for installation, usage, and training models are available on deepspeech.readthedocs.io.

For the latest release, including pre-trained models and checkpoints, see the latest release on GitHub.

For contribution guidelines, see CONTRIBUTING.rst.

For contact and support information, see SUPPORT.rst.

About

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

License:Mozilla Public License 2.0


Languages

Language:C++ 46.9%Language:Python 21.4%Language:C 11.2%Language:Shell 10.8%Language:C# 2.8%Language:Swift 1.8%Language:Java 1.3%Language:Makefile 0.9%Language:CMake 0.9%Language:TypeScript 0.6%Language:SWIG 0.5%Language:JavaScript 0.4%Language:Starlark 0.4%Language:Awk 0.1%Language:Ruby 0.0%Language:Objective-C 0.0%