Jiwen Yu's starred repositories
Make-A-Protagonist
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Prompt-Free-Diffusion
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024
Fantasia3D
(ICCV2023) official repository for "Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation"
Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
All-In-One-Deflicker
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
learning_research
本人的科研经验
Personalize-SAM
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
threestudio
A unified framework for 3D content generation.
stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
SD-CN-Animation
This script allows to automate video stylization task using StableDiffusion and ControlNet.
IJCAI2023-CoNR
IJCAI2023 - Collaborative Neural Rendering using Anime Character Sheets
matting_human_datasets
人像matting数据集,包含34427张图像和对应的matting结果图。
Text2Performer
Code for Text2Performer. Paper: Text2Performer: Text-Driven Human Video Generation
SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
stylegan-t
[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis