jacobswan1

Jacob's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25063 219 448

generative-models

Generative Models by Stability AI

Language:PythonMIT22943 248 270

shap-e

Generate 3D objects conditioned on text or images

Language:PythonMIT11396 239 109

IF

Language:PythonNOASSERTION7541 84 98

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonApache-2.07220 100 1424

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonNOASSERTION4214 68 65

Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonNOASSERTION3868 65 69

sd-webui-roop

roop extension for StableDiffusion web-ui

Language:PythonAGPL-3.03271 25 278

VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

Language:PythonNOASSERTION2581 54 120

TemporalKit

An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extension

Language:PythonGPL-3.01877 38 129

parti

Apache-2.01521 56 9

tapnet

Tracking Any Point (TAP)

Language:PythonApache-2.01122 29 91

synthetic-computer-vision

A list of synthetic dataset and tools for computer vision

Language:PythonMIT995 81 2

mdetr

Language:PythonApache-2.0954 19 96

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonMIT876 9 17

dreambooth

CC-BY-4.0773 14 4

ControlVideo

[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

Language:PythonMIT728 22 30

fastcomposer

FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Language:PythonMIT613 21 32

InST

Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)

Language:Jupyter NotebookApache-2.0480 8 54

ProFusion

Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach

Language:Jupyter NotebookApache-2.0458 16 20

pytorch_ema

Tiny PyTorch library for maintaining a moving average of a collection of parameters.

Language:PythonMIT391 4 8

WaveDiff

Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)

Language:PythonGPL-3.0339 12 13

clipscore

CLIPScore EMNLP code

Language:PythonMIT170 2 13

T5-Textual-Inversion

Textual Inversion for DeepFloyd IF

Language:Jupyter NotebookAGPL-3.038 3 1

null-text-inversion-colab

Colab implementation of Google's null-text inversion.

Language:Jupyter Notebook33 1 1

tise-toolbox

TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)

Language:PythonApache-2.033 40

streamlit-aws-tutorial

Language:Python25 20

ReMuQ

a multimodal retrieval dataset

Language:Jupyter Notebook18 1 2

deepspeed-sagemaker-example

Language:Jupyter NotebookApache-2.017 30

aws-neuron-eks-samples

Language:PythonMIT-014 170