Jacob's starred repositories
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
ControlNet
Let us control diffusion models!
MidJourney-Styles-and-Keywords-Reference
A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!
open_flamingo
An open-source framework for training large multimodal models.
T2I-Adapter
T2I-Adapter
pytorch-fid
Compute FID scores with PyTorch.
CrossAttentionControl
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
long_stable_diffusion
Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)
semantic-diffusion-model
Official Implementation of Semantic Image Synthesis via Diffusion Models
sagemaker-distributed-training-workshop
Hands-on workshop for distributed training and hosting on SageMaker
dreambooth_depth2img
adaptation of huggingface's dreambooth training script to support depth2img
tise-toolbox
TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)