Beast code in Giters

我的AI世界's repositories

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonApache-2.0000

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT000

bark

🔊 Text-Prompted Generative Audio Model

MIT000

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonAGPL-3.0000

ChatTTS

TTS

NOASSERTION000

Chinese-Names-Corpus

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

Apache-2.0000

CosyVoice

LLM based TTS model, providing inference/training/deployment full-stack ability.

Language:PythonApache-2.0000

EasyBertVits2

文章から感情豊かな音声を生成する Bert-VITS2 を簡単に使えます。

Language:BatchfileMIT000

espeak-phonemizer

Uses ctypes and libespeak-ng to transform test into IPA phonemes

Language:PythonGPL-3.0000

fish-speech

Brand new TTS solution

Language:PythonBSD-3-Clause000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. ｜语音识别工具包，包含丰富的性能优越的开源预训练模型，支持语音识别、语音端点检测、文本后处理等，具备服务部署能力。

Language:PythonNOASSERTION000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT000

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonNOASSERTION000

leedl-tutorial

《李宏毅深度学习教程》，PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

NOASSERTION000

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:PythonAGPL-3.0000

MassTTS

a TTS demo for training new characters.

Language:PythonApache-2.0000

megatts2

Unoffical implement of Megatts2

Language:PythonMIT000

mistral-finetune

Apache-2.0000

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Apache-2.0000

jin1258804025

我的AI世界's repositories

crf_torch_onnx

3D-Speaker

Amphion

bark

Bert-VITS2

ChatTTS

Chinese-Names-Corpus

CosyVoice

EasyBertVits2

espeak-phonemizer

fish-speech

FunASR

GPT-SoVITS

HunyuanDiT

leedl-tutorial

MARS5-TTS

MassTTS

megatts2

mistral-finetune

PaddleSpeech

parler-tts

polyphone

sherpa-onnx

spear-tts-pytorch

StyleTTS

StyleTTS2

tts-frontend-dataset

Viphoneme

vocos

wetts