Jacob (jacobswan1)

jacobswan1

Geek Repo

Company: Amazon Alexa AI.

Location:San Jose

Github PK Tool:Github PK Tool

Jacob's starred repositories

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:22223Issues:235Issues:254

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:8738Issues:98Issues:302

CoDeF

[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonLicense:NOASSERTIONStargazers:4754Issues:73Issues:76

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4079Issues:49Issues:90

Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2883Issues:23Issues:97

make-a-video-pytorch

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

Language:PythonLicense:MITStargazers:1838Issues:72Issues:15

dreamoving-project

Official implementation of DreaMoving

tapnet

Tracking Any Point (TAP)

Language:PythonLicense:Apache-2.0Stargazers:1040Issues:28Issues:80

synthetic-computer-vision

A list of synthetic dataset and tools for computer vision

Language:PythonLicense:MITStargazers:991Issues:81Issues:2

Text-To-Video-Finetuning

Finetune ModelScope's Text To Video model using Diffusers 🧨

Language:PythonLicense:MITStargazers:607Issues:18Issues:68

SCUNet

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis (Machine Intelligence Research 2023)

Language:PythonLicense:Apache-2.0Stargazers:582Issues:17Issues:24

control-a-video

Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"

Language:PythonLicense:GPL-3.0Stargazers:337Issues:22Issues:28

CM3Leon

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Language:PythonLicense:MITStargazers:318Issues:21Issues:14

VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Gen-L-Video

The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:259Issues:17Issues:25

LongerCrafter

[ICLR 2024] Code for FreeNoise based on VideoCrafter

AlignProp

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion

Language:PythonLicense:MITStargazers:181Issues:7Issues:12

hyperdreambooth

Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Language:PythonLicense:MITStargazers:153Issues:25Issues:1

cpl

Code for Contrastive Preference Learning (CPL)

Language:PythonLicense:MITStargazers:129Issues:3Issues:5

VideoLDM

Unofficial PyTorch implementation of the VideoLDM.

Language:PythonLicense:MITStargazers:128Issues:13Issues:7

make-a-stable-diffusion-video

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch - fork with video pseudo3d

Language:PythonLicense:Apache-2.0Stargazers:95Issues:0Issues:0

retrieval-augmented-diffusion-models

Official codebase for the Paper “Retrieval-Augmented Diffusion Models”

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:93Issues:9Issues:5

webdataset-lightning

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

Language:PythonLicense:BSD-3-ClauseStargazers:70Issues:0Issues:0

null-text-inversion-colab

Colab implementation of Google's null-text inversion.

Language:Jupyter NotebookStargazers:32Issues:1Issues:1
License:Apache-2.0Stargazers:21Issues:0Issues:0

ReMuQ

a multimodal retrieval dataset

Language:Jupyter NotebookStargazers:17Issues:1Issues:0
Language:PythonLicense:MIT-0Stargazers:13Issues:15Issues:0