Yin Xinlei's starred repositories
open_flamingo
An open-source framework for training large multimodal models.
sigsep-mus-db
Python parser and tools for MUSDB18 Music Separation Dataset
WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
audio-retrieval-benchmark
Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
audio-flamingo
PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
mtg-jamendo-dataset
Metadata, scripts and baselines for the MTG-Jamendo dataset
Zero_Shot_Audio_Source_Separation
The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022
ustcthesis
LaTeX template for USTC thesis
AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis