jacobswan1

Jacob's starred repositories

llama

Inference code for LLaMA models

Language:PythonNOASSERTION50895 499 872

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.029028 341 267

ControlNet

Let us control diffusion models!

Language:PythonApache-2.028599 213 521

MidJourney-Styles-and-Keywords-Reference

A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!

11741 280 16

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause9035 95 619

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.06712 59 137

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonMIT3519 47 170

pytorch-fid

Compute FID scores with PyTorch.

Language:PythonApache-2.03166 14 84

prompt-to-prompt

Language:Jupyter NotebookApache-2.02915 24 74

composer

Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"

MIT1526 174 8

CrossAttentionControl

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

Language:Jupyter NotebookMIT1253 22 26

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonMIT876 9 17

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Language:PythonApache-2.0747 6 10

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Language:Jupyter NotebookMIT728 8 35

GroupViT

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

Language:PythonNOASSERTION702 11 63

long_stable_diffusion

Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)

Language:Python676 16 3

ELITE

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)

Language:PythonApache-2.0483 45 19

OpenPSG

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Language:PythonMIT399 6 90

alexa-teacher-models

Language:PythonApache-2.0362 36 7

webui-stability-api

Language:PythonAGPL-3.0320 11 8

paco

This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts, and visualization notebooks.

Language:PythonMIT256 19 8

pytorch-unsupervised-segmentation-tip

Language:PythonMIT251 5 18

semantic-diffusion-model

Official Implementation of Semantic Image Synthesis via Diffusion Models

Language:Python206 7 26

pegbis

Python implementation of "Efficient Graph-Based Image Segmentation" paper

Language:Python141 4 7

sagemaker-distributed-training-workshop

Hands-on workshop for distributed training and hosting on SageMaker

Language:Jupyter NotebookApache-2.0115 5 1

dreambooth_depth2img

adaptation of huggingface's dreambooth training script to support depth2img

Language:PythonMIT99 3 11

LAION-5B-WatermarkDetection

Language:PythonMIT92 4 3

tise-toolbox

TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)

Language:PythonApache-2.033 40