Liujingxiu23's repositories

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

BEGANSing

Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN

Language:PythonStargazers:1Issues:0Issues:0

CVPR2021-Papers-with-Code

CVPR 2021 论文和开源项目合集

Stargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

NU-Wave-pytorch

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

AD-NeRF

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Language:PythonStargazers:0Issues:0Issues:0

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

BlendShapeMaker

BlendShapeMaker python3.6

Stargazers:0Issues:0Issues:0

BunchedLPCnet

This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

EVP

Code for paper 'Audio-Driven Emotional Video Portraits'.

Stargazers:0Issues:0Issues:0

few-shot-vid2vid

Pytorch implementation for few-shot photorealistic video-to-video translation.

License:NOASSERTIONStargazers:0Issues:0Issues:0

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

License:NOASSERTIONStargazers:0Issues:0Issues:0

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Mead

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

License:MITStargazers:0Issues:0Issues:0

mlp-singer

Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis

License:MITStargazers:0Issues:0Issues:0

Muskits

An opensource music processing toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

neural-waveshaping-synthesis

efficient neural audio synthesis in the waveform domain

License:MPL-2.0Stargazers:0Issues:0Issues:0

Robust_Fine_Grained_Prosody_Control

PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

score_sde

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Stargazers:0Issues:0Issues:0

ssnt-tts

An implementation of SSNT-TTS.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

STYLER

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, Interspeech 2021

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stargazers:0Issues:0Issues:0

Talking-Face-Generation-DAVS

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

License:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

WaveFlow

WaveFlow : A Compact Flow-based Model for Raw Audio

Stargazers:0Issues:0Issues:0

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

License:MITStargazers:0Issues:0Issues:0