hildazzz's starred repositories
Bert-VITS2-ext
基于Bert-VITS2做的表情、动画测试. Animation testing based on Bert-VITS2.
chn_text_norm
Chinese text normalization. 中文文本规范化。
python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
flow-matching
Annotated Flow Matching paper
torch-stft
An STFT/iSTFT for PyTorch.
silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
python-soundfile
SoundFile is an audio library based on libsndfile, CFFI, and NumPy
adam-atan2-pytorch
Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch
CosyVoice_For_Windows
CosyVoice在Windows环境下使用的版本
sap-voicebox
Speech Processing Toolbox for MATLAB
persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
ScriptsForVoxBlink2
Official Repository For VoxBlink2
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
best-rq-pytorch
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
I_am_a_person
实时互动的GPT数字人
SenseVoice
Multilingual Voice Understanding Model