aparcho

aparcho

Geek Repo

Github PK Tool:Github PK Tool

aparcho's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:39731Issues:395Issues:1287

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33319Issues:308Issues:418

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:30584Issues:266Issues:1043

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:26770Issues:178Issues:844

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:26573Issues:203Issues:194

ios_rule_script

分流规则、重写写规则及脚本。

Language:JavaScriptLicense:GPL-2.0Stargazers:15834Issues:241Issues:1072

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12161Issues:168Issues:492

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10364Issues:185Issues:1877

teslamate

A self-hosted data logger for your Tesla 🚘

Language:ElixirLicense:MITStargazers:5436Issues:131Issues:1303

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4072Issues:54Issues:116

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:3796Issues:90Issues:980

llm-foundry

LLM training code for Databricks foundation models

Language:PythonLicense:Apache-2.0Stargazers:3783Issues:48Issues:360

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Language:PythonLicense:Apache-2.0Stargazers:3740Issues:78Issues:683

tdl

📥 A Telegram tookit written in Golang

Language:GoLicense:AGPL-3.0Stargazers:3509Issues:21Issues:394

Senta

Baidu's open-source Sentiment Analysis System.

Language:PythonLicense:Apache-2.0Stargazers:1849Issues:60Issues:86

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1797Issues:32Issues:159

PPASR

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Language:PythonLicense:Apache-2.0Stargazers:779Issues:11Issues:173

diffusion_models

A series of tutorial notebooks on denoising diffusion probabilistic models in PyTorch

Language:Jupyter NotebookStargazers:611Issues:10Issues:8

pytorch-softdtw-cuda

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch using Numba

Language:PythonLicense:MITStargazers:595Issues:10Issues:28

neural_sp

End-to-end ASR/LM implementation with PyTorch

Language:PythonLicense:Apache-2.0Stargazers:586Issues:33Issues:89

tacotronv2_wavernn_chinese

tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)

WaveGrad

Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:398Issues:17Issues:26

survae_flows

Code for paper "SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows"

Language:PythonLicense:MITStargazers:283Issues:28Issues:18

wavegrad

A fast, high-quality neural vocoder.

Language:PythonLicense:Apache-2.0Stargazers:266Issues:15Issues:15

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

Language:PythonLicense:BSD-3-ClauseStargazers:264Issues:18Issues:11

OpenChineseLLaMA

Chinese large language model base generated through incremental pre-training on Chinese datasets

Language:PythonLicense:GPL-3.0Stargazers:231Issues:5Issues:9

VITS-BigVGAN-SpanPSP-Chinese

基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。

tacotron2-vae

Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:166Issues:10Issues:6

VectorQuantizedCPC

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Language:PythonLicense:MITStargazers:139Issues:4Issues:7

Voice-conversion-evaluation

An evaluation toolkit for voice conversion models.