Mehdi Cherti's repositories
feed_forward_vqgan_clip
Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt
DALLE_clip_score
Simple script to compute CLIP-based scores given a DALL-e trained model.
Auto-PyTorch
Automatic architecture search and hyperparameter optimization for PyTorch
ConvNeXt
Code release for ConvNeXt model
DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
deit
Official DeiT repository
denoising-diffusion-gan
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs https://arxiv.org/abs/2112.07804
dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
embedding-reader
Efficiently read embedding in streaming from any filesystem
glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis model
mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
open_clip
An open source implementation of CLIP.
pHash
pHash - the open source perceptual hash library
scaling-laws-openclip
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
SLIP
Code release for SLIP Self-supervision meets Language-Image Pre-training
slurm-tracking-bot
Simple slurm tracking bot to check usage
stable-diffusion-webui
Stable Diffusion web UI
stylegan_xl
StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets