Manish Sahu's starred repositories

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:46356Issues:1132Issues:1341

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35604Issues:1002Issues:186

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11214Issues:166Issues:222

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9317Issues:77Issues:109

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8615Issues:131Issues:436

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6139Issues:71Issues:230

notebooks

Notebooks using the Hugging Face libraries 🤗

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3465Issues:71Issues:159

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:3180Issues:50Issues:101

DIS

This is the repo for our new project Highly Accurate Dichotomous Image Segmentation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2125Issues:92Issues:119

uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Language:PythonLicense:Apache-2.0Stargazers:977Issues:14Issues:25

Wav2Lip-GFPGAN

High quality Lip sync

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

IP_LAP

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

Language:PythonLicense:Apache-2.0Stargazers:637Issues:18Issues:54

StyleHEAT

[ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation

Language:PythonLicense:MITStargazers:619Issues:37Issues:50

photos-app

➡️ Moved to https://github.com/ente-io/ente

Language:DartLicense:GPL-3.0Stargazers:534Issues:10Issues:83

onnx2tflite

Tool for onnx->keras or onnx->tflite. If tool is useful for you, please star it.

Language:PythonLicense:Apache-2.0Stargazers:489Issues:5Issues:65

FACIAL

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

Language:PythonLicense:AGPL-3.0Stargazers:375Issues:11Issues:90

DABA

Official implementation of "Decentralization and Acceleration Enables Large-Scale Bundle Adjustment"

Language:CudaLicense:MITStargazers:348Issues:9Issues:8

Mead

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

Language:PythonLicense:MITStargazers:229Issues:8Issues:34

MARLIN

[CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg

Language:PythonLicense:NOASSERTIONStargazers:210Issues:9Issues:22

Learnbale_Bandpass_Filter

Image Demoireing with Learnable Bandpass Filters. (CVPR, 2020)(Keras+TensorFlow)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:110Issues:11Issues:23
Language:PythonLicense:Apache-2.0Stargazers:94Issues:6Issues:12

Wav2Lip-Emotion

Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We also propose a novel automatic evaluation for emotion modification corroborated with a user study.

TheGorgeousLogin

Login page built with @flutter 😍

Language:DartLicense:MITStargazers:17Issues:1Issues:0
Language:DartStargazers:15Issues:0Issues:0
Language:DartStargazers:11Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0