István Ketykó's starred repositories
GaussianTalker
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
talking-face-arxiv-daily
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
MTDVocaLiST
Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).
MoCoGAN-HD
[ICLR 2021 Spotlight] A Good Image Generator Is What You Need for High-Resolution Video Synthesis
stylegan3-editing
Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" (AIM ECCVW 2022) https://arxiv.org/abs/2201.13433
StoryDiffusion
Create Magic Story!
AutoLink-Self-supervised-Learning-of-Human-Skeletons-and-Object-Outlines-by-Linking-Keypoints
[NeurIPS 2022] AutoLink, a simple and novel unsupervised approach to detect keypoints from single static images
understanding-mediapipe-facemesh-output
Resources for understanding the output of MediaPipe's Face Mesh.
WeightStandardization
Standardizing weights to accelerate micro-batch training
SemanticGuidedHumanMatting
Robust Human Matting via Semantic Guidance, ACCV 2022.
GaussianAvatars
[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
CelebAMask-HQ
A large-scale face dataset for face parsing, recognition, generation and editing.
FFHQ-Aging-Dataset
FFHQ-Aging Dataset
co-tracker
CoTracker is a model for tracking any point (pixel) on a video.