Beast code in Giters

Ciaran O'Reilly's repositories

vosk-browser

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

Language:JavaScriptApache-2.0361 19 63

LocalSTT

Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech

Language:JavaGPL-3.094 12 8

wav2vec2-service

Language:PythonMIT38 3 6

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CGPL-3.01 20

NetflixEnCatala

Extensió pel Chrome que automàticament silencia l'àudio de Netflix i reprodueix el doblatge en català.

Language:JavaScriptMIT1 20

robust-wav2vec2-sprint

Language:Python1 30

captioner

Generate subtitles of videos in the browser

Language:TypeScript020

clapack-wasm

Language:C020

commonvoice-utils

Linguistic processing for Common Voice

Language:PythonAGPL-3.0020

conv_ssl

Language:Python010

coqui-ai-tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++Apache-2.0020

datapipe

An audio ETL pipeline for generating datasets from youtube sources

Language:PythonAGPL-3.0030

jocsdemots

Language:PythonApache-2.0030

kaldi

Language:ShellNOASSERTION030

QuickVC-VoiceConversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

Language:PythonMIT010

raspberry-pi-pwm-fan-control

raspberry pi pwm fan control

Language:PythonGPL-3.0020

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:PythonNOASSERTION010

speech-to-speech

000

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0010

spksrc

Cross compilation framework to create native packages for the Synology's NAS

Language:MakefileNOASSERTION010

streaming-source-separation

Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.

Language:Python010

STT

The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Language:C++MPL-2.0020

telegram-deepspeech-bot

A Telegram bot that infers text from voice notes using DeepSpeech

Language:PythonMIT030

tevr-asr-tool

State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.

Language:CMIT020

text2lang

Language detection api based on ivanlau/language-detection-fine-tuned-on-xlm-roberta-base

Language:PythonMIT030

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0020

VoiceActivityProjection

Voice Activity Projection Models: Self-supervised learning of Turn-taking Events

Language:PythonMIT010

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:C++Apache-2.0020

vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Language:PythonApache-2.0020

vscode-audio-preview

VS Code Extension to preview and play wav file.

Language:TypeScriptMIT020