Haolin Chen (chl17)

chl17

Geek Repo

Company:Idiap Research Institute

Location:Martigny, Switzerland

Home Page:https://hl-chen.com

Github PK Tool:Github PK Tool

Haolin Chen's starred repositories

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33840Issues:315Issues:422

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12473Issues:166Issues:502

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10951Issues:195Issues:2143

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9902Issues:131Issues:48

iRingo

解锁完整的 Apple功能和集成服务

Language:JavaScriptLicense:GPL-3.0Stargazers:8949Issues:87Issues:174

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6811Issues:59Issues:137

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4451Issues:76Issues:179

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonLicense:MITStargazers:4193Issues:43Issues:100

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2973Issues:47Issues:77

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonLicense:MITStargazers:2920Issues:90Issues:97

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2328Issues:60Issues:167

continual-learning

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Language:Jupyter NotebookLicense:MITStargazers:1504Issues:28Issues:30

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:1237Issues:56Issues:30

PD-Runner-Revived

PD-Runner (Parallels Desktop) 补档

ML_course

EPFL Machine Learning Course, Fall 2023

Language:Jupyter NotebookStargazers:1182Issues:92Issues:21

NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Language:PythonLicense:MITStargazers:959Issues:20Issues:26

2024-Tech-OA

List of Tech Company OAs. Save your time from finding them all over the internet.

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter NotebookStargazers:545Issues:23Issues:28

DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language:PythonLicense:MITStargazers:303Issues:9Issues:27

CharsiuG2P

Multilingual G2P in 100 languages

Language:Jupyter NotebookLicense:MITStargazers:266Issues:10Issues:10

reserves-lib-tsinghua-downloader

Download pages from http://reserves.lib.tsinghua.edu.cn/

Language:PythonLicense:GPL-3.0Stargazers:217Issues:3Issues:7

P.808

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).

Language:HTMLLicense:MITStargazers:199Issues:23Issues:24

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

Language:PythonLicense:MITStargazers:198Issues:15Issues:40

nngeometry

{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch

Language:PythonLicense:MITStargazers:197Issues:7Issues:32

turkle

Django-based clone of Amazon's Mechanical Turk service running in your local environment.

Language:PythonLicense:NOASSERTIONStargazers:142Issues:17Issues:129

EKFAC-pytorch

Repository containing Pytorch code for EKFAC and K-FAC perconditioners.

Language:PythonLicense:MITStargazers:138Issues:7Issues:5

UnitSpeech

An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:125Issues:11Issues:8

beaqlejs

*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.

Language:JavaScriptLicense:GPL-3.0Stargazers:86Issues:18Issues:14

listening-test

An open source platform for browser based speech and audio subjective quality tests.

Language:TypeScriptLicense:MITStargazers:32Issues:4Issues:6