semicarryispig's starred repositories
ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
DeepFilterNet
Noise supression using deep filtering
xiaoyuzhoufmdownload
下载小宇宙播客中的音频
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
chinese_speech_pretrain
chinese speech pretrained models
brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
parler-tts
Inference and training library for high-quality TTS models.
mlm-scoring
Python library & examples for Masked Language Model Scoring (ACL 2020)
hume-python-sdk
Python client for Hume AI APIs
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
spear-tts-pytorch
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
metavoice-src
Foundational model for human-like, expressive TTS