ketyi

István Ketykó's starred repositories

xlstm

Official repository of the xLSTM.

Language:PythonAGPL-3.075300

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:Python185000

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonNOASSERTION158100

Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim

Language:PythonNOASSERTION16500

talking-face-arxiv-daily

🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.

Language:PythonApache-2.03700

MTDVocaLiST

Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).

Language:Python1700

Cap

Open source Loom alternative. Effortless, instant screen sharing.

Language:TypeScriptAGPL-3.0327600

MoCoGAN-HD

[ICLR 2021 Spotlight] A Good Image Generator Is What You Need for High-Resolution Video Synthesis

Language:PythonNOASSERTION23800

stylegan3-editing

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" (AIM ECCVW 2022) https://arxiv.org/abs/2201.13433

Language:PythonMIT64500

AniTalker

Apache-2.0110300

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookApache-2.0523400

AutoLink-Self-supervised-Learning-of-Human-Skeletons-and-Object-Outlines-by-Linking-Keypoints

[NeurIPS 2022] AutoLink, a simple and novel unsupervised approach to detect keypoints from single static images

Language:PythonMIT4000

understanding-mediapipe-facemesh-output

Resources for understanding the output of MediaPipe's Face Mesh.

Language:JavaScriptApache-2.01100

WeightStandardization

Standardizing weights to accelerate micro-batch training

54300

ARLDM

Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models

Language:PythonMIT18000

SemanticGuidedHumanMatting

Robust Human Matting via Semantic Guidance, ACCV 2022.

Language:PythonMIT22000

FaceTalk

[CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models

Language:ShellNOASSERTION14300

DPHMs

[CVPR2024] DPHMs: Diffusion Parametric Head Models for Depth-based Tracking

3800

NPHM

[CVPR'23] Learning Neural Parametric Head Models

Language:PythonNOASSERTION19300

GaussianAvatars

[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"

Language:Python47800

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonMIT416100

GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Language:PythonMIT121600

ganavatar

[3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar

Language:PythonNOASSERTION5300

CelebAMask-HQ

A large-scale face dataset for face parsing, recognition, generation and editing.

Language:Python202800

FFHQ-Aging-Dataset

FFHQ-Aging Dataset

Language:PythonNOASSERTION25500

differential-diffusion

Language:Python31700

co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Language:Jupyter NotebookNOASSERTION248500

dot

Dense Optical Tracking: Connecting the Dots

Language:PythonMIT20300