WonwoongCho's starred repositories

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:Jupyter NotebookLicense:MITStargazers:47632Issues:429Issues:119

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:36109Issues:428Issues:283

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:27770Issues:212Issues:512

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:15976Issues:152Issues:1241

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13416Issues:112Issues:355

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLLicense:MITStargazers:9988Issues:267Issues:42
Language:PythonLicense:NOASSERTIONStargazers:7493Issues:80Issues:97

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:4954Issues:32Issues:266

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4238Issues:34Issues:187

glide-text2im

GLIDE: a diffusion-based text-conditional image synthesis model

Language:PythonLicense:MITStargazers:3462Issues:162Issues:44

pytorch-fid

Compute FID scores with PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:3051Issues:14Issues:82
Language:Jupyter NotebookLicense:MITStargazers:2792Issues:53Issues:155

clip-interrogator

Image to prompt with BLIP and CLIP

Language:PythonLicense:MITStargazers:2471Issues:30Issues:91

sg2im

Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 2018

Language:PythonLicense:Apache-2.0Stargazers:1286Issues:44Issues:27

Scene-Graph-Benchmark.pytorch

A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”

Language:Jupyter NotebookLicense:MITStargazers:1011Issues:17Issues:197

graph-rcnn.pytorch

[ECCV 2018] Official code for "Graph R-CNN for Scene Graph Generation"

long_stable_diffusion

Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)

fastcomposer

FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Language:PythonLicense:MITStargazers:587Issues:21Issues:30

Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch

[ECCV 2022] Compositional Generation using Diffusion Models

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:430Issues:16Issues:21

OpenPSG

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Language:PythonLicense:MITStargazers:384Issues:6Issues:86

Mini-DALLE3

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:262Issues:6Issues:13

compose-visual-relations

[NeurIPS 2021 Spotlight] Learning to Compose Visual Relations

energy-based-scene-graph

Code release for Energy-Based Learning for Scene Graph Genertaion

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:90Issues:4Issues:9

SGDiff

Official implementation for "Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training" https://arxiv.org/abs/2211.11138

PENET

[CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"

Language:Jupyter NotebookLicense:MITStargazers:39Issues:2Issues:3

EasyFace

Easy-to-use Face Analysis Tool

Language:PythonLicense:MITStargazers:32Issues:3Issues:0

CanonicalSg2Im

Code for "Learning Canonical Representations for Scene Graph to Image Generation", Herzig & Bar et al., ECCV2020

Language:PythonLicense:MITStargazers:28Issues:4Issues:4

SQUAT

The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:14Issues:0Issues:0