Haolin Chen's starred repositories
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
continual-learning
PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
PD-Runner-Revived
PD-Runner (Parallels Desktop) 补档
2024-Tech-OA
List of Tech Company OAs. Save your time from finding them all over the internet.
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
CharsiuG2P
Multilingual G2P in 100 languages
reserves-lib-tsinghua-downloader
Download pages from http://reserves.lib.tsinghua.edu.cn/
P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
nngeometry
{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch
EKFAC-pytorch
Repository containing Pytorch code for EKFAC and K-FAC perconditioners.
UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
listening-test
An open source platform for browser based speech and audio subjective quality tests.