wangqi-xxxx's repositories
SMFANet
[ECCV 2024] SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution
ScribbleArchitect
Transform your simple scribbles into architectural designs using style transfer with Stable Diffusion, LCM, IP Adapters and ControlNet. Scribble Architect combines creativity with generative AI technology, improving the inspiration process.
MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
scribble-diffusion
Turn your rough sketch into a refined image using AI
Vitron
A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
DragNoise
[CVPR2024] Official code for Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation
FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
VideoBooth
[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts
StoryGen
[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
FaceDNeRF
FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models (NeurIPS 2023)
SeeSR
[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
StableVITON
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
DisCo
[CVPR2024] DisCo: Referring Human Dance Generation in Real World
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
CoDeF
Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
robsut-wrod-reocginiton
Robsut Wrod Reocginiton via semi-Character Recurrent Neural Network