Mehdi Cherti's starred repositories
k-diffusion
Karras et al. (2022) diffusion models for PyTorch
ClassyVision
An end-to-end PyTorch framework for image and video classification
Video-Pre-Training
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Arrival-Movie-Live-Coding
Documents from a live coding session by Christopher Wolfram related to content from the 2016 film Arrival
RegionCLIP
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
parti-pytorch
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
dalle2-laion
Pretrained Dalle2 from laion
simulacra-aesthetic-captions
Dataset of prompts, synthetic AI generated images, and aesthetic ratings.
LAION-Face
The human face subset of LAION-400M for large-scale face pretraining.
Implicit-Language-Q-Learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
perceiver-ar-pytorch
Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch
Elevater_Toolkit_IC
Toolkit for Elevater Benchmark
temporal-embedding-aggregation
Aggregating embeddings over time
conditioned-prior
(wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.