Ruiqi Li's starred repositories
pytorchvideo
A deep learning library for video understanding research.
acad-homepage.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
RectifiedFlow
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
Prompt-Singer
Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
parler-tts
Inference and training library for high-quality TTS models.
VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.