Shen Meng (MengShen0709)

MengShen0709

Geek Repo

Company:Nanyang Technological University

Location:Singapore

Home Page:https://mengshen0709.github.io/

Github PK Tool:Github PK Tool

Shen Meng's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:140610Issues:1076Issues:7643

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:52287Issues:939Issues:1080

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:46647Issues:1134Issues:1340

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:34262Issues:287Issues:1099

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30236Issues:428Issues:4186

roop

one-click face swap

Language:PythonLicense:GPL-3.0Stargazers:28181Issues:253Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonLicense:MITStargazers:22991Issues:511Issues:2474

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:11585Issues:97Issues:342

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

face-alignment

:fire: 2D and 3D Face alignment library build using pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:7032Issues:172Issues:311

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3089Issues:49Issues:80

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:2010Issues:49Issues:126

noisereduce

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Language:Jupyter NotebookLicense:MITStargazers:1428Issues:23Issues:75

DiCE

Generate Diverse Counterfactual Explanations for any machine learning model.

Language:PythonLicense:MITStargazers:1344Issues:19Issues:169

Awesome-Deepfakes-Detection

A list of tools, papers and code related to Deepfake Detection.

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonLicense:NOASSERTIONStargazers:835Issues:15Issues:111

fsgan

FSGAN - Official PyTorch Implementation

Language:Jupyter NotebookLicense:CC0-1.0Stargazers:750Issues:29Issues:172

Audio-driven-TalkingFace-HeadPose

Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)

visqol

Perceptual Quality Estimator for speech and audio

Language:C++License:Apache-2.0Stargazers:682Issues:28Issues:71

StyleHEAT

[ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation

Language:PythonLicense:MITStargazers:636Issues:37Issues:50

DiffTalk

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"

EVP

Code for paper 'Audio-Driven Emotional Video Portraits'.

Language:Jupyter NotebookStargazers:297Issues:18Issues:31

Speech-Editing-Toolkit

It's a repository for implementations of neural speech editing algorithms.

Multimodal-action-recognition

Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.

DeepFake-Adapter

Code for DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection

GILA

Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"

Language:PythonLicense:NOASSERTIONStargazers:17Issues:1Issues:4