MengShen0709

followers

following

stars

Nanyang Technological University

Singapore

https://mengshen0709.github.io/

Shen Meng's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.0140610 1076 7643

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION52287 939 1080

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonGPL-3.046647 1134 1340

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.034262 287 1099

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT30236 428 4186

roop

one-click face swap

Language:PythonGPL-3.028181 2530

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonMIT22991 511 2474

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookMIT11585 97 342

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:Python10395 167 657

face-alignment

:fire: 2D and 3D Face alignment library build using pytorch

Language:PythonBSD-3-Clause7032 172 311

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookNOASSERTION3089 49 80

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.02010 49 126

noisereduce

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Language:Jupyter NotebookMIT1428 23 75

awesome-talking-head-generation

DiCE

Generate Diverse Counterfactual Explanations for any machine learning model.

Language:PythonMIT1344 19 169

Awesome-Deepfakes-Detection

A list of tools, papers and code related to Deepfake Detection.

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonNOASSERTION835 15 111

fsgan

FSGAN - Official PyTorch Implementation

Language:Jupyter NotebookCC0-1.0750 29 172

Audio-driven-TalkingFace-HeadPose

Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)

Language:Python720 25 70

visqol

Perceptual Quality Estimator for speech and audio

Language:C++Apache-2.0682 28 71

StyleHEAT

[ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation

Language:PythonMIT636 37 50

DiffTalk

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"

Language:Python440 45 39

EVP

Code for paper 'Audio-Driven Emotional Video Portraits'.

Language:Jupyter Notebook297 18 31

Speech-Editing-Toolkit

It's a repository for implementations of neural speech editing algorithms.

Language:Python187 9 24

ICT_DeepFake

Language:Python95 1 16

Multimodal-action-recognition

Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.

Language:Python70 1 6

audio-visual-forensics

Language:PythonMIT68 6 11

unicats

DeepFake-Adapter

Code for DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection

GILA

Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"

Language:PythonNOASSERTION17 1 4