chl17

Haolin Chen's starred repositories

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT33840 315 422

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.012473 166 502

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.010951 195 2143

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonNOASSERTION9902 131 48

iRingo

解锁完整的 Apple功能和集成服务

Language:JavaScriptGPL-3.08949 87 174

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.06811 59 137

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT4451 76 179

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonMIT4193 43 100

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookNOASSERTION2973 47 77

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonMIT2920 90 97

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT2328 60 167

continual-learning

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Language:Jupyter NotebookMIT1504 28 30

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonMIT1237 56 30

PD-Runner-Revived

PD-Runner (Parallels Desktop) 补档

Language:Swift1204 46 35

ML_course

EPFL Machine Learning Course, Fall 2023

Language:Jupyter Notebook1182 92 21

NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Language:PythonMIT959 20 26

2024-Tech-OA

List of Tech Company OAs. Save your time from finding them all over the internet.

888 78 8

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter Notebook545 23 28

DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language:PythonMIT303 9 27

CharsiuG2P

Multilingual G2P in 100 languages

Language:Jupyter NotebookMIT266 10 10

reserves-lib-tsinghua-downloader

Download pages from http://reserves.lib.tsinghua.edu.cn/

Language:PythonGPL-3.0217 3 7

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).

Language:HTMLMIT199 23 24