wty-ustc

wty-ustc

Geek Repo

Location:Hefei, China

Github PK Tool:Github PK Tool

wty-ustc's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:136560Issues:1052Issues:7547

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66815Issues:555Issues:706

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:37752Issues:442Issues:296

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23433Issues:252Issues:283

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10556Issues:123Issues:205

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9307Issues:76Issues:454

imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Language:PythonLicense:MITStargazers:7900Issues:113Issues:300

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookLicense:MITStargazers:7526Issues:92Issues:146

PRNet

Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network (ECCV 2018)

Language:PythonLicense:MITStargazers:4933Issues:189Issues:201
Language:PythonLicense:NOASSERTIONStargazers:3185Issues:159Issues:111

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language:Jupyter NotebookLicense:MITStargazers:3183Issues:39Issues:107
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2981Issues:24Issues:75
Language:Jupyter NotebookLicense:MITStargazers:2866Issues:53Issues:157

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2730Issues:48Issues:87

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonLicense:Apache-2.0Stargazers:2276Issues:41Issues:345

DECA

DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)

Language:PythonLicense:NOASSERTIONStargazers:2075Issues:40Issues:211

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

StyleGAN-Human

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

PTI

Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744

Language:Jupyter NotebookLicense:MITStargazers:891Issues:23Issues:57

Text2LIVE

Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)

Language:PythonLicense:MITStargazers:876Issues:28Issues:21

DiffusionCLIP

[CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:775Issues:8Issues:37

blended-latent-diffusion

Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]

Language:Jupyter NotebookLicense:MITStargazers:543Issues:49Issues:14
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:475Issues:25Issues:22

sketchedit

SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches, CVPR2022

Language:PythonLicense:NOASSERTIONStargazers:241Issues:11Issues:8

OPERA

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Language:PythonLicense:MITStargazers:216Issues:2Issues:33

StyleSwap

StyleSwap: Style-Based Generator Empowers Robust Face Swapping (ECCV 2022)

Language:PythonLicense:Apache-2.0Stargazers:198Issues:38Issues:12

DiffusionDisentanglement

Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:153Issues:6Issues:8

NED

PyTorch implementation for NED (CVPR 2022). It can be used to manipulate the facial emotions of actors in videos based on emotion labels or reference styles.

Language:PythonLicense:MITStargazers:152Issues:8Issues:8

MNeuEdit

Code for Mesh-Guided Neural Implicit Field Editing.

License:Apache-2.0Stargazers:19Issues:6Issues:0