AndreJJXu's starred repositories
blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
SyncDiffusion
Official implementation of SyncDiffusion.
MultiDiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
ModalBiasAVSR
Offical implementation of the CVPR 2024 paper: A Study of Dropout-Induced Modality Bias on Robustness to Missing Video.
clotho-dataset
Python code for handling the Clotho dataset.
ClipClap-GZSL
Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models
lyrebird-wav2clip
Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
PerceptualSimilarity
LPIPS metric. pip install lpips
Generating-Realistic-Images-from-In-the-wild-Sounds
Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Shifted_Diffusion
Code for Shifted Diffusion for Text-to-image Generation (CVPR 2023)