문이세's repositories
Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
multi-hmr
Pytorch demo code and models for Multi-HMR
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Rope
GUI-focused roop
InstructAvatar
Official implementation of the paper 'InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation'
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
facefusion
Next generation face swapper and enhancer
implicit-deepfake
Official repository of paper "ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting"
Make-An-Audio-2
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
StoryDiffusion
Create Magic Story!
GFPGAN-1024
GFPGAN 1024
MagicDance
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
smplx
SMPL-X
tech-interview-for-developer
👶🏻 신입 개발자 전공 지식 & 기술 면접 백과사전 📖
netron
Visualizer for neural network, deep learning and machine learning models
ml-hugs
Official repository of HUGS: Human Gaussian Splats (CVPR 2024)
awesome-blender
🪐 A curated list of awesome Blender addons, tools, tutorials; and 3D resources for everyone.
whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
One-2-3-45
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"
HQ-Edit
HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
Awesome-Avatars
List of recent advances for human avatars, including generation, reconstruction, and editing, etc.
DGInStyle
DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
DGInStyle-SegModel
Downstream semantic segmentation evaluation of DGInStyle.
point2cad
Code for "Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds"
SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
HyperLips
Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".
TeCH
[3DV 2024] Official repo of "TeCH: Text-guided Reconstruction of Lifelike Clothed Humans"
articulated-animation
Code for Motion Representations for Articulated Animation paper
talking-head-anime-4-demo
Demo Programs for the "Talking Head(?) Anime from a Single Image 4: Improved Models and Its Distillation" Project
points2poly
Reconstructing compact building models from point clouds using deep implicit fields [ISPRS 2022]