Jiwen Yu's starred repositories

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptLicense:MITStargazers:5411Issues:0Issues:0

DCCM

Compressive Confocal Microscopy Imaging at the Single-Photon Level with Ultra-Low Sampling Ratios (Communications Engineering 2024) [PyTorch]

Language:PythonStargazers:7Issues:0Issues:0

Make-A-Protagonist

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

Language:PythonLicense:Apache-2.0Stargazers:316Issues:0Issues:0

Prompt-Free-Diffusion

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Language:PythonLicense:MITStargazers:716Issues:0Issues:0

Fantasia3D

(ICCV2023) official repository for "Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation"

Language:PythonLicense:Apache-2.0Stargazers:718Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:1096Issues:0Issues:0

Waifu2x-Extension-GUI

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.

Language:C++License:NOASSERTIONStargazers:12565Issues:0Issues:0

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35612Issues:0Issues:0

MasaCtrl

[ICCV 2023] Consistent Image Synthesis and Editing

Language:PythonLicense:Apache-2.0Stargazers:687Issues:0Issues:0
Language:PythonLicense:MITStargazers:582Issues:0Issues:0

All-In-One-Deflicker

[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas

Language:PythonStargazers:670Issues:0Issues:0

StableSR

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonLicense:NOASSERTIONStargazers:2018Issues:0Issues:0

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:8124Issues:0Issues:0

learning_research

本人的科研经验

Stargazers:5053Issues:0Issues:0

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonLicense:MITStargazers:1478Issues:0Issues:0

threestudio

A unified framework for 3D content generation.

Language:PythonLicense:Apache-2.0Stargazers:6003Issues:0Issues:0

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonLicense:Apache-2.0Stargazers:8065Issues:0Issues:0

SD-CN-Animation

This script allows to automate video stylization task using StableDiffusion and ControlNet.

Language:PythonLicense:MITStargazers:806Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9323Issues:0Issues:0

dolphin

General video interaction platform based on LLMs, including Video ChatGPT

Language:PythonLicense:MITStargazers:248Issues:0Issues:0

IJCAI2023-CoNR

IJCAI2023 - Collaborative Neural Rendering using Anime Character Sheets

Language:Jupyter NotebookLicense:MITStargazers:791Issues:0Issues:0

matting_human_datasets

人像matting数据集,包含34427张图像和对应的matting结果图。

License:NOASSERTIONStargazers:599Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:7592Issues:0Issues:0

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6776Issues:0Issues:0

GPT4Tools

GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.

Language:PythonLicense:NOASSERTIONStargazers:746Issues:0Issues:0

Text2Performer

Code for Text2Performer. Paper: Text2Performer: Text-Driven Human Video Generation

Language:PythonLicense:NOASSERTIONStargazers:312Issues:0Issues:0

lama

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7642Issues:0Issues:0

SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Language:PythonStargazers:1750Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:391Issues:0Issues:0

stylegan-t

[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Language:PythonLicense:NOASSERTIONStargazers:1141Issues:0Issues:0