gaetan-sony's starred repositories
LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
audio-flamingo
PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
genmusic_demo_list
a list of demo websites for automatic music generation research
pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
VQ-Diffusion
Official implementation of VQ-Diffusion
music-inpainting-ts
A collection of web interfaces for AI-assisted interactive music creation
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
PerceptualSimilarity
LPIPS metric. pip install lpips