SiyeolJung

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonApache-2.0659200

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION6719400

sqvae

Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)

Language:PythonApache-2.017600

CVQ-VAE

[ICCV 2023] Online Clustered Codebook

Language:PythonMIT13100

Awesome-Image-Quality-Assessment

A comprehensive collection of IQA papers

Language:TeXMIT87500

MMSI

Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)

Language:PythonMIT800

OmniTokenizer

OmniTokenizer: one model and one weight for image-video joint tokenization.

Language:PythonMIT21100

VoxMM

Language:Python1400

IQA-PyTorch

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Language:PythonNOASSERTION173200

S2G-MDDiffusion

Language:PythonMIT5700

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonNOASSERTION117300

mixture-of-experts

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Language:PythonMIT58700

mmvae

Multimodal Mixture-of-Experts VAE

Language:PythonGPL-3.018700

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT1409700

DistgASR

[TPAMI 2022] DistgASR: Disentangling Mechanism for Light Field Angular Super-Resolution

Language:Python2900

Grid-Diffusion-Models-for-Text-to-Video-Generation

Official Code Repository for the paper "Grid Diffusion Models for Text-to-Video Generation", CVPR 2024

900

Generating-Realistic-Images-from-In-the-wild-Sounds

Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023

Language:Jupyter Notebook900