Jacob (jacobswan1)

jacobswan1

Geek Repo

Company: Amazon Alexa AI.

Location:San Jose

Github PK Tool:Github PK Tool

Jacob's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25063Issues:219Issues:448

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:22943Issues:248Issues:270

shap-e

Generate 3D objects conditioned on text or images

Language:PythonLicense:MITStargazers:11396Issues:239Issues:109
Language:PythonLicense:NOASSERTIONStargazers:7541Issues:84Issues:98

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7220Issues:100Issues:1424

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4214Issues:68Issues:65

Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonLicense:NOASSERTIONStargazers:3868Issues:65Issues:69

sd-webui-roop

roop extension for StableDiffusion web-ui

Language:PythonLicense:AGPL-3.0Stargazers:3271Issues:25Issues:278

VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

Language:PythonLicense:NOASSERTIONStargazers:2581Issues:54Issues:120

TemporalKit

An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extension

Language:PythonLicense:GPL-3.0Stargazers:1877Issues:38Issues:129

tapnet

Tracking Any Point (TAP)

Language:PythonLicense:Apache-2.0Stargazers:1122Issues:29Issues:91

synthetic-computer-vision

A list of synthetic dataset and tools for computer vision

Language:PythonLicense:MITStargazers:995Issues:81Issues:2
Language:PythonLicense:Apache-2.0Stargazers:954Issues:19Issues:96

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:876Issues:9Issues:17

ControlVideo

[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

Language:PythonLicense:MITStargazers:728Issues:22Issues:30

fastcomposer

FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Language:PythonLicense:MITStargazers:613Issues:21Issues:32

InST

Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:480Issues:8Issues:54

ProFusion

Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:458Issues:16Issues:20

pytorch_ema

Tiny PyTorch library for maintaining a moving average of a collection of parameters.

Language:PythonLicense:MITStargazers:391Issues:4Issues:8

WaveDiff

Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)

Language:PythonLicense:GPL-3.0Stargazers:339Issues:12Issues:13

clipscore

CLIPScore EMNLP code

Language:PythonLicense:MITStargazers:170Issues:2Issues:13

T5-Textual-Inversion

Textual Inversion for DeepFloyd IF

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:38Issues:3Issues:1

null-text-inversion-colab

Colab implementation of Google's null-text inversion.

Language:Jupyter NotebookStargazers:33Issues:1Issues:1

tise-toolbox

TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)

Language:PythonLicense:Apache-2.0Stargazers:33Issues:4Issues:0

ReMuQ

a multimodal retrieval dataset

Language:Jupyter NotebookStargazers:18Issues:1Issues:2
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:17Issues:3Issues:0
Language:PythonLicense:MIT-0Stargazers:14Issues:17Issues:0