Jingdong Li's starred repositories
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Meta-voicebox
Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.
causal-conv1d
Causal depthwise conv1d in CUDA, with a PyTorch interface
torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
CausalityCheck
Causality Check in Frame-online Speech Separation
interspeech2023-moving-iva-samples
Repository containing samples produced by the method proposed in "Multi-channel separation of dynamic speech and sound events" and presented at Interspeech 2023.
SpeakerVerSim
Python-based simulation framework for different version control strategies of speaker recognition systems.