Beast code in Giters

Yunlin Chen's starred repositories

Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).

38600

fast-vid2vid

The code for ECCV22 paper "Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis"

Language:Python15600

Mead

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

Language:PythonMIT22700

Tune-A-Video

Unofficial implementation of Tune-A-Video

Language:Python18800

python-ffmpeg-video-streaming

📼 Package media content for online streaming(DASH and HLS) using FFmpeg

Language:PythonMIT82700

ffmpeg-python

Python bindings for FFmpeg - with complex filtering support

Language:PythonApache-2.0966200

ffmpeg-rtmp

Ffmpeg RTMP example

Language:C1200

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonMIT291200

diffused-heads

Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Language:PythonNOASSERTION44700

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonMIT159200

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonApache-2.0274000

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonMIT85800

havenask

Language:C++Apache-2.0151900

css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Language:HTMLApache-2.045600

Awesome-Image-Harmonization

A curated list of papers, code and resources pertaining to image harmonization.

40300

Face2FaceRHO

The Official PyTorch Implementation for Face2Face^ρ (ECCV2022)

Language:PythonBSD-3-Clause21200

chrome-music-lab

A collection of experiments for exploring how music works, all built with the Web Audio API.

Language:JavaScriptApache-2.0211500

botsim

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Language:Jupyter NotebookBSD-3-Clause11300

Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Language:PythonMIT214200

TTS-Portuguese-Corpus

Open Source Text-To-Speech Portuguese Dataset

CC-BY-4.014700

FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

Language:C++Apache-2.0281700

linzai1992