wty-ustc

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language:Jupyter NotebookMIT3183 39 107

prompt-to-prompt

Language:Jupyter NotebookApache-2.02981 24 75

textual_inversion

Language:Jupyter NotebookMIT2866 53 157

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookApache-2.02730 48 87

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonApache-2.02276 41 345

DECA

DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)

Language:PythonNOASSERTION2075 40 211

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

Apache-2.01914 112 30

bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1335 90

StyleGAN-Human

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Language:Python1126 36 49

PTI

Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744

Language:Jupyter NotebookMIT891 23 57

Text2LIVE

Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)

Language:PythonMIT876 28 21

DiffusionCLIP

[CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models

Language:PythonNOASSERTION775 8 37

blended-latent-diffusion

Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]

Language:Jupyter NotebookMIT543 49 14

MakeItTalk

Language:Jupyter NotebookNOASSERTION475 25 22

sketchedit

SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches, CVPR2022

Language:PythonNOASSERTION241 11 8

OPERA

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Language:PythonMIT216 2 33

StyleSwap

StyleSwap: Style-Based Generator Empowers Robust Face Swapping (ECCV 2022)

Language:PythonApache-2.0198 38 12

DiffusionDisentanglement

Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models

Language:Jupyter NotebookNOASSERTION153 6 8

NED

PyTorch implementation for NED (CVPR 2022). It can be used to manipulate the facial emotions of actors in videos based on emotion labels or reference styles.

Language:PythonMIT152 8 8

MNeuEdit

Code for Mesh-Guided Neural Implicit Field Editing.

Apache-2.019 60