kai wang (wangkai930418)

wangkai930418

Geek Repo

Company:CVC,UAB

Location:Barcelona

Home Page:wangkai930418.github.io

Github PK Tool:Github PK Tool

kai wang's starred repositories

Language:PythonStargazers:8Issues:0Issues:0

Dimba

Transformer-Mamba Diffusion Models

Language:PythonStargazers:25Issues:0Issues:0

IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:PythonStargazers:2726Issues:0Issues:0

tryondiffusion

TryOnDiffusion: A Tale of Two UNets Implementation

Language:Jupyter NotebookStargazers:309Issues:0Issues:0

PatchScaler

PatchScaler: An Efficient Patch-independent Diffusion Model for Super-Resolution

License:Apache-2.0Stargazers:16Issues:0Issues:0

fondant-usecase-controlnet

Example Fondant pipeline preparing data to train a Controlnet model

Language:Jupyter NotebookStargazers:15Issues:0Issues:0

fondant-clip-index

Create a CLIP index for an image dataset with Fondant

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3Issues:0Issues:0

WB_sRGB

White balance camera-rendered sRGB images (CVPR 2019) [Matlab & Python]

Language:MATLABLicense:NOASSERTIONStargazers:316Issues:0Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3356Issues:0Issues:0

DMPlug

This is the official implementation of "DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models".

Language:PythonStargazers:18Issues:0Issues:0

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:1836Issues:0Issues:0

refiners

A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation

Language:PythonLicense:MITStargazers:250Issues:0Issues:0

layerdiffuse

Implementation of layer diffuse inference using refiners

Language:PythonStargazers:19Issues:0Issues:0

AdvUnlearn

Official implementation of "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models"

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:7Issues:0Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonStargazers:21666Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:2087Issues:0Issues:0

ELLA

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Language:PythonLicense:Apache-2.0Stargazers:892Issues:0Issues:0
Language:PythonLicense:MITStargazers:16Issues:0Issues:0

2d-gaussian-splatting

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Language:PythonLicense:NOASSERTIONStargazers:1373Issues:0Issues:0

visual_anagrams

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Language:Jupyter NotebookLicense:MITStargazers:738Issues:0Issues:0

RDDM

CVPR 2024: Residual Denoising Diffusion Models

Language:PythonStargazers:215Issues:0Issues:0

research-GANwriting

Source code for ECCV20 "GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images"

Language:PythonLicense:MITStargazers:65Issues:0Issues:0

MachineUnlearning-DocClassification

Official Implementation for ICDAR2024 paper "Machine Unlearning for Document Classification"

License:MITStargazers:3Issues:0Issues:0

DE-GAN

Document Image Enhancement with GANs - TPAMI journal

Language:PythonLicense:GPL-3.0Stargazers:163Issues:0Issues:0

SSL-OCR

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023

Language:PythonLicense:MITStargazers:22Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3620Issues:0Issues:0

CoMat

Official code for 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Language:PythonStargazers:103Issues:0Issues:0

tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Language:PythonLicense:Apache-2.0Stargazers:115Issues:0Issues:0

T2I-CompBench

[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Language:PythonLicense:MITStargazers:153Issues:0Issues:0

Diffuser-layerdiffuse

Unofficial implementation of Layer Diffuse in diffusers

Language:PythonStargazers:21Issues:0Issues:0