Dogucan Yaman (yamand16)

yamand16

Geek Repo

Location:Germany

Github PK Tool:Github PK Tool

Dogucan Yaman's starred repositories

Language:PythonLicense:MITStargazers:6092Issues:0Issues:0

CREMA-D

Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)

Language:RLicense:NOASSERTIONStargazers:344Issues:0Issues:0

HDTF

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

Language:PythonLicense:GPL-3.0Stargazers:343Issues:0Issues:0

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonLicense:UnlicenseStargazers:83318Issues:0Issues:0

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonLicense:NOASSERTIONStargazers:833Issues:0Issues:0

MDTVSFA

[official] Unified Quality Assessment of In-the-Wild Videos with Mixed Datasets Training (IJCV 2021)

Language:PythonLicense:MITStargazers:82Issues:0Issues:0
Language:PythonStargazers:396Issues:0Issues:0

StyleSync

Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"

Language:PythonStargazers:290Issues:0Issues:0

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonLicense:MITStargazers:1656Issues:0Issues:0

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Language:PythonStargazers:961Issues:0Issues:0

Palette-Image-to-Image-Diffusion-Models

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

Language:PythonLicense:MITStargazers:1496Issues:0Issues:0

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:6496Issues:0Issues:0

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Language:PythonLicense:NOASSERTIONStargazers:35604Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6418Issues:0Issues:0

DeepLip

deep-learning based audio-visual lip bometrics

Language:PythonStargazers:14Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:34517Issues:0Issues:0

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8860Issues:0Issues:0

VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Language:PythonLicense:MITStargazers:357Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46860Issues:0Issues:0

Awesome-Novel-Class-Discovery

A list of papers that studies Novel Class Discovery

Stargazers:426Issues:0Issues:0

gans-in-action

Companion repository to GANs in Action: Deep learning with Generative Adversarial Networks

Language:Jupyter NotebookStargazers:1008Issues:0Issues:0

muavic

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Language:PythonLicense:NOASSERTIONStargazers:353Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:6273Issues:0Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonLicense:BSD-3-ClauseStargazers:27840Issues:0Issues:0

BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonLicense:Apache-2.0Stargazers:6696Issues:0Issues:0

VQFR

ECCV 2022, Oral, VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Language:PythonLicense:NOASSERTIONStargazers:325Issues:0Issues:0

facexlib

FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.

Language:PythonLicense:MITStargazers:820Issues:0Issues:0
Language:PythonLicense:MITStargazers:19Issues:0Issues:0

AAAI22-one-shot-talking-face

Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)

Language:PythonStargazers:352Issues:0Issues:0