Ross Wightman's starred repositories
stable-diffusion
A latent text-to-image diffusion model
dalle-mini
DALL·E Mini - Generate images from a text prompt
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
EfficientFormer
EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]
ResizeRight
The correct way to resize images or tensors. For Numpy or Pytorch (differentiable).
CLIP_benchmark
CLIP-like model evaluation
aesthetic-predictor
A linear estimator on top of clip to predict the aesthetic quality of pictures
vae-textures
Texture mapping with variational auto-encoders
detectron2_timm
A simple wrapper library for binding timm models as detectron2 backbones
open_clip_juwels
An open source implementation of CLIP.
clip_benchmark
clip retrieval benchmark