문이세's repositories
3DDFA-V3
The official implementation of 3DDFA_V3 in CVPR2024 (Highlight).
admet_ai
Training and prediction scripts for Chemprop models trained on ADMET datasets
awesome-conditional-content-generation
Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.
awesome-cvpr-2024
🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024
awesome-body-language
This repo is used for recording and tracking some Multi-modal Body Language researchs,In this work, we present the first detailed survey on Multi-modal Body Language research. We survey the research in 2 directions: Recognition and Generation;and 4 parts: Cued Speech, Co-speech, Sign Language, Talking Head.
Awesome-Deepfake-Generation-and-Detection
A Survey on Deepfake Generation and Detection
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
EmpathyEar
Multimodal Empathetic Chatbot
EXAONE-3.0
Official repository for EXAONE built by LG AI Research
facefusion
Next generation face swapper and enhancer
GaussianTalker
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
implicit-deepfake
Official repository of paper "ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting"
IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
LipSick
🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
MagicDance
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
Make-An-Audio-2
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
mindiffusion
Repository of lessons exploring image diffusion models, focused on understanding and education.
multi-hmr
Pytorch demo code and models for Multi-HMR
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
smplx
SMPL-X
StoryDiffusion
Create Magic Story!
talking-face-arxiv-daily
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
top-cvpr-2024-papers
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".