Siyeol Jung's starred repositories
SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
MM-Diffusion
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Resemblyzer
A python package to analyze and compare voices with deep learning
Dyadic-Interaction-Modeling
[ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation
vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
stable-diffusion
A latent text-to-image diffusion model
Awesome-Image-Quality-Assessment
A comprehensive collection of IQA papers
OmniTokenizer
OmniTokenizer: one model and one weight for image-video joint tokenization.
IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
mixture-of-experts
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Grid-Diffusion-Models-for-Text-to-Video-Generation
Official Code Repository for the paper "Grid Diffusion Models for Text-to-Video Generation", CVPR 2024
Generating-Realistic-Images-from-In-the-wild-Sounds
Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023