dragnDriver's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40483Issues:394Issues:1293

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:35615Issues:328Issues:437

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:25567Issues:177Issues:130

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonLicense:Apache-2.0Stargazers:4709Issues:40Issues:566

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Language:PythonLicense:Apache-2.0Stargazers:3386Issues:49Issues:78

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1922Issues:31Issues:162

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:1785Issues:28Issues:214

emotional-vits

无需情感标注的情感可控语音合成模型,基于VITS

Language:Jupyter NotebookLicense:MITStargazers:1316Issues:12Issues:34

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:1269Issues:53Issues:31

speech-synthesis-paper

List of speech synthesis papers.

conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Language:PythonLicense:Apache-2.0Stargazers:946Issues:9Issues:37

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Language:PythonLicense:MITStargazers:856Issues:71Issues:0

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:854Issues:30Issues:95

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:774Issues:33Issues:46

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonLicense:MITStargazers:682Issues:38Issues:76

ChineseBert

Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"

Language:PythonLicense:MITStargazers:541Issues:7Issues:61

w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Language:Jupyter NotebookLicense:MITStargazers:442Issues:9Issues:16

Muskits

An opensource music processing toolkit

Language:PythonLicense:Apache-2.0Stargazers:310Issues:16Issues:31

hifi-gan-bwe

Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.

Language:PythonLicense:MITStargazers:203Issues:9Issues:9

Learn2Sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

Language:PythonLicense:MITStargazers:95Issues:5Issues:6
Language:PythonLicense:MITStargazers:93Issues:5Issues:5

Subband_Kalman_AEC

Subband kalman filter for echo cancellation

Language:MATLABLicense:MITStargazers:40Issues:4Issues:1

unitypackages

Unity 组件工具合集

FastSpeech2-cwt

with alignment learning and continuous wavelet transform

Language:Jupyter NotebookLicense:MITStargazers:19Issues:2Issues:0

math-quaternion-slerp

Demonstration of Quaternion Spherical Linear Interpolation (Slerp)

Language:C++License:GPL-3.0Stargazers:9Issues:9Issues:0