István Ketykó (ketyi)

ketyi

Geek Repo

Company:Colossyan

Location:Budapest

Home Page:https://orcid.org/0000-0003-4931-4580

Github PK Tool:Github PK Tool


Organizations
colossyan
deep-modeling
Representation-learning-for-PM

István Ketykó's starred repositories

xlstm

Official repository of the xLSTM.

Language:PythonLicense:AGPL-3.0Stargazers:753Issues:0Issues:0

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:PythonStargazers:1850Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:106Issues:0Issues:0

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonLicense:NOASSERTIONStargazers:1581Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:115Issues:0Issues:0

GaussianTalker

Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim

Language:PythonLicense:NOASSERTIONStargazers:165Issues:0Issues:0

talking-face-arxiv-daily

🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.

Language:PythonLicense:Apache-2.0Stargazers:37Issues:0Issues:0

MTDVocaLiST

Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).

Language:PythonStargazers:17Issues:0Issues:0

Cap

Open source Loom alternative. Effortless, instant screen sharing.

Language:TypeScriptLicense:AGPL-3.0Stargazers:3276Issues:0Issues:0

MoCoGAN-HD

[ICLR 2021 Spotlight] A Good Image Generator Is What You Need for High-Resolution Video Synthesis

Language:PythonLicense:NOASSERTIONStargazers:238Issues:0Issues:0

stylegan3-editing

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" (AIM ECCVW 2022) https://arxiv.org/abs/2201.13433

Language:PythonLicense:MITStargazers:645Issues:0Issues:0
License:Apache-2.0Stargazers:1103Issues:0Issues:0

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5234Issues:0Issues:0

AutoLink-Self-supervised-Learning-of-Human-Skeletons-and-Object-Outlines-by-Linking-Keypoints

[NeurIPS 2022] AutoLink, a simple and novel unsupervised approach to detect keypoints from single static images

Language:PythonLicense:MITStargazers:40Issues:0Issues:0

understanding-mediapipe-facemesh-output

Resources for understanding the output of MediaPipe's Face Mesh.

Language:JavaScriptLicense:Apache-2.0Stargazers:11Issues:0Issues:0

WeightStandardization

Standardizing weights to accelerate micro-batch training

Stargazers:543Issues:0Issues:0

ARLDM

Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models

Language:PythonLicense:MITStargazers:180Issues:0Issues:0

SemanticGuidedHumanMatting

Robust Human Matting via Semantic Guidance, ACCV 2022.

Language:PythonLicense:MITStargazers:220Issues:0Issues:0

FaceTalk

[CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models

Language:ShellLicense:NOASSERTIONStargazers:143Issues:0Issues:0

DPHMs

[CVPR2024] DPHMs: Diffusion Parametric Head Models for Depth-based Tracking

Stargazers:38Issues:0Issues:0

NPHM

[CVPR'23] Learning Neural Parametric Head Models

Language:PythonLicense:NOASSERTIONStargazers:193Issues:0Issues:0

GaussianAvatars

[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"

Language:PythonStargazers:478Issues:0Issues:0

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonLicense:MITStargazers:4161Issues:0Issues:0

GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Language:PythonLicense:MITStargazers:1216Issues:0Issues:0

ganavatar

[3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar

Language:PythonLicense:NOASSERTIONStargazers:53Issues:0Issues:0

CelebAMask-HQ

A large-scale face dataset for face parsing, recognition, generation and editing.

Language:PythonStargazers:2028Issues:0Issues:0

FFHQ-Aging-Dataset

FFHQ-Aging Dataset

Language:PythonLicense:NOASSERTIONStargazers:255Issues:0Issues:0
Language:PythonStargazers:317Issues:0Issues:0

co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2485Issues:0Issues:0

dot

Dense Optical Tracking: Connecting the Dots

Language:PythonLicense:MITStargazers:203Issues:0Issues:0