municef1

문이세's repositories

3DDFA-V3

The official implementation of 3DDFA_V3 in CVPR2024 (Highlight).

Language:PythonMIT000

admet_ai

Training and prediction scripts for Chemprop models trained on ADMET datasets

Language:HTMLMIT000

awesome-conditional-content-generation

Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.

000

awesome-cvpr-2024

🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024

Language:PythonCC0-1.0000

This repo is used for recording and tracking some Multi-modal Body Language researchs，In this work, we present the first detailed survey on Multi-modal Body Language research. We survey the research in 2 directions: Recognition and Generation；and 4 parts: Cued Speech, Co-speech, Sign Language, Talking Head.

000

Awesome-Deepfake-Generation-and-Detection

A Survey on Deepfake Generation and Detection

000

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

MIT000

CogVideo

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonApache-2.0000

consistency2

NOASSERTION000

EmpathyEar

Multimodal Empathetic Chatbot

000

EXAONE-3.0

Official repository for EXAONE built by LG AI Research

NOASSERTION000

facefusion

Next generation face swapper and enhancer

Language:PythonNOASSERTION000

GaussianTalker

Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim

NOASSERTION000

implicit-deepfake

Official repository of paper "ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting"

000

IMS-Toucan

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

Apache-2.0000

LipSick

🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮

Unlicense000

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

MIT000

MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Language:Python010

Make-An-Audio-2

a text-conditional diffusion probabilistic model capable of generating high fidelity audio.

MIT000

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Apache-2.0000

mindiffusion

Repository of lessons exploring image diffusion models, focused on understanding and education.

Language:PythonMIT000

multi-hmr

Pytorch demo code and models for Multi-HMR

Language:PythonNOASSERTION000

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonNOASSERTION000

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Apache-2.0000

smplx

SMPL-X

Language:PythonNOASSERTION000

StoryDiffusion

Create Magic Story!

Apache-2.0000

talking-face-arxiv-daily

🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.

Language:PythonApache-2.0000

top-cvpr-2024-papers

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

Language:PythonCC0-1.0000

UniAnimate

Code for Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".

Language:Python000

unique3d-diffusion

Language:Python000