gachaun's repositories
AndroidFFmpeg
android 读取摄像头和麦克风,使用rtmp推流
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Bert-VITS2
vits2 backbone with multilingual-bert
Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
espnet
End-to-End Speech Processing Toolkit
forced-alignment-tools
A collection of links and notes on forced alignment tools
git-recipes
🥡 Git recipes in Chinese by Zhongyi Tong. 高质量的Git中文教程.
NativeSpeaker
make your Speaker talking as Native style with own voice!
NDK_OpenGLES_3_0
Android OpenGL ES 3.0 从入门到精通系统性学习教程
OpenGLCamera2
🔥 Android OpenGL Camera 2.0 实现 30 多种滤镜和抖音特效
PL-BERT
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
speechbrain
A PyTorch-based Speech Toolkit
vits_chinese
vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统
vits_chinese-1
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support streaming out!
stable-ts
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
Statistical-Learning-Method_Code
手写实现李航《统计学习方法》书中全部算法
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
vits-simple-api
A simple VITS HTTP API, developed by extending Moegoe with additional features.
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).