Open-Debin

followers

following

stars

Chinese Academy of Sciences（SIAT-CAS）

Debin Meng's starred repositories

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonNOASSERTION35604 1003 187

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.030849 315 890

LLM101n

LLM101n: Let's build a Storyteller

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Language:Jupyter NotebookApache-2.08742 91 317

Omost

Your image is almost there!

Language:PythonApache-2.06931 44 68

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookApache-2.05536 86 125

sd-scripts

Language:PythonApache-2.04557 50 838

1806

18.06 course at MIT

Language:Jupyter Notebook2387 131 6

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Language:Jupyter NotebookAGPL-3.01607 26 49

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonNOASSERTION1566 22 34

torch-fidelity

High-fidelity performance metrics for generative models in PyTorch

Language:PythonNOASSERTION934 7 35

PTI

Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744

Language:Jupyter NotebookMIT891 23 57

Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Language:Jupyter NotebookMIT663 15 35

stylegan3-editing

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" (AIM ECCVW 2022) https://arxiv.org/abs/2201.13433

Language:PythonMIT645 35 51

anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Language:Python53000

PIRender

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

Language:PythonNOASSERTION508 20 34

SoraReview

The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".

DenseDiffusion

Official Pytorch Implementation of DenseDiffusion (ICCV 2023)

Language:Jupyter NotebookApache-2.0460 11 18

platonic-rep

Language:Python390 12 5

Awesome-CVPR2024-ECCV2024-AIGC

A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC

TheChosenOne

Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"

Language:Python227 9 18

EfficientFace

[AAAI'21] Robust Lightweight Facial Expression Recognition Network with Label Distribution Training

Language:PythonMIT179 1 28

dcface

Language:Python121 9 28

DFER-CLIP

[BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition

Language:PythonMIT91 2 3

Linguistic-Binding-in-Diffusion-Models

Language:Jupyter Notebook67 2 3

coursera-mathematical-thinking

Language:TeX46 80

ClassDiffusion

ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"

Language:Python2800

MPS

Language:HTML14 1 1

Multi-Modal-Prompt

Language:Python500

collaborative-neural-painting

Language:Python300