Emmanuel Schmidbauer's repositories
websocket-audio-stream
pyaudio & websocket to stream real-time audio to speakers
voicefixer
General Speech Restoration
acoustic-model
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
CoMoSpeech
one-step diffusion based speech synthesis
FlexFlow
A distributed deep learning framework.
flutter_sherpa_onnx
Flutter plugin wrapping the Sherpa-ONNX runtime
freeswitch
FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile software implementation that runs on any commodity hardware. From a Raspberry PI to a multi-core server, FreeSWITCH can unlock the telecommunications potential of any device.
greenswitch
Battle proven FreeSWITCH Event Socket Protocol client implementation with Gevent
kamailio
Kamailio - The Open Source SIP Server
metaseq
Repo for external large-scale work
peerless
Peerless Animate API
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
mod_audio_stream
FreeSWITCH module to stream audio to websocket and receive response
mod_vad
a voice activity detection module for freeswitch.
NeMo-text-processing
NeMo text processing for ASR and TTS
pkg-kamailio-docker
Docker files to easily build Kamailio on different Debian/Ubuntu releases
RAD-MMM
A TTS model that makes a speaker speak new languages
RVC_CLI
RVC CLI enables seamless interaction with Retrieval-based Voice Conversion through commands or HTTP requests.
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
whisper-cpp-server
whisper-cpp-server
X-E-Speech-code
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion