Kaizhi Qian's starred repositories
stable-audio-tools
Generative models for conditional audio generation
speechbrain
A PyTorch-based Speech Toolkit
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
textlesslib
Library for Textless Spoken Language Processing
Diffusion-LM
Diffusion-LM
VID-Sentence
This repository provides the dataset introduced by our WSSTG paper
zfchenUnique
My personal repository
DCL-Release
This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).
GNS-PyTorch
A PyTorch implementation of the “Graph Network-based Simulators” (GNS) model from DeepMind for simulating particle-based dynamics using graph networks.
contentvec
speech self-supervised representations
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.