aparcho

followers

0

following

stars

aparcho's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.039731 395 1287

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT33319 308 418

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.030584 266 1043

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT26770 178 844

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT26573 203 194

ios_rule_script

分流规则、重写写规则及脚本。

Language:JavaScriptGPL-2.015834 241 1072

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.012161 168 492

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.010364 185 1877

teslamate

A self-hosted data logger for your Tesla 🚘

Language:ElixirMIT5436 131 1303

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT4072 54 116

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonApache-2.03796 90 980

llm-foundry

LLM training code for Databricks foundation models

Language:PythonApache-2.03783 48 360

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Language:PythonApache-2.03740 78 683

tdl

📥 A Telegram tookit written in Golang

Language:GoAGPL-3.03509 21 394

Senta

Baidu's open-source Sentiment Analysis System.

Language:PythonApache-2.01849 60 86

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT1797 32 159

PPASR

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Language:PythonApache-2.0779 11 173

diffusion_models

A series of tutorial notebooks on denoising diffusion probabilistic models in PyTorch

Language:Jupyter Notebook611 10 8

pytorch-softdtw-cuda

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch using Numba

Language:PythonMIT595 10 28

neural_sp

End-to-end ASR/LM implementation with PyTorch

Language:PythonApache-2.0586 33 89

tacotronv2_wavernn_chinese

tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)

Language:Python511 9 63

WaveGrad

Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.

Language:Jupyter NotebookBSD-3-Clause398 17 26

survae_flows

Code for paper "SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows"

Language:PythonMIT283 28 18

wavegrad

A fast, high-quality neural vocoder.

Language:PythonApache-2.0266 15 15

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

Language:PythonBSD-3-Clause264 18 11

OpenChineseLLaMA

Chinese large language model base generated through incremental pre-training on Chinese datasets

Language:PythonGPL-3.0231 5 9

VITS-BigVGAN-SpanPSP-Chinese

基于PyTorch的VITS-BigVGAN的tts中文模型，加入韵律预测模型。

Language:Python184 3 10

tacotron2-vae

Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"

Language:Jupyter NotebookBSD-3-Clause166 10 6

VectorQuantizedCPC

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Language:PythonMIT139 4 7

Voice-conversion-evaluation

An evaluation toolkit for voice conversion models.

Language:Python39 2 3