Beast code in Giters

dragnDriver's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.040483 394 1293

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT35615 328 437

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION26666 287 41

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonAGPL-3.025567 177 130

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonApache-2.04709 40 566

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Language:PythonApache-2.03386 49 78

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT1922 31 162

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonMIT1785 28 214

NeuralSpeech

Language:PythonMIT1371 33 124

emotional-vits

无需情感标注的情感可控语音合成模型，基于VITS

Language:Jupyter NotebookMIT1316 12 34

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonMIT1269 53 31

speech-synthesis-paper

List of speech synthesis papers.

MIT993 60 4

conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Language:PythonApache-2.0946 9 37

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Language:PythonMIT856 710

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Language:Jupyter NotebookBSD-3-Clause854 30 95

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonMIT774 33 46

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonMIT682 38 76

ChineseBert

Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"

Language:PythonMIT541 7 61

w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Language:Jupyter NotebookMIT442 9 16

Diffusion-SVC

Language:PythonMIT406 9 26

Muskits

An opensource music processing toolkit

Language:PythonApache-2.0310 16 31

hifi-gan-bwe

Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.

Language:PythonMIT203 9 9

Learn2Sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

Language:JavaScript175 6 7

HiFiSinger

Language:PythonMIT107 6 8

FCPE

Language:PythonMIT95 5 6

HarmoF0

Language:PythonMIT93 5 5

Subband_Kalman_AEC

Subband kalman filter for echo cancellation

Language:MATLABMIT40 4 1

unitypackages

Unity 组件工具合集

MIT39 1 4

FastSpeech2-cwt

with alignment learning and continuous wavelet transform

Language:Jupyter NotebookMIT19 20

math-quaternion-slerp

Demonstration of Quaternion Spherical Linear Interpolation (Slerp)

Language:C++GPL-3.09 90