Liujingxiu23

followers

following

stars

Liujingxiu23's repositories

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonMIT200

BEGANSing

Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN

Language:Python100

CVPR2021-Papers-with-Code

CVPR 2021 论文和开源项目合集

100

guided-diffusion

Language:PythonMIT100

musicXML_parser

Language:Python100

NU-Wave-pytorch

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

Language:PythonMIT100

AD-NeRF

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Language:Python000

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

000

BinauralSpeechSynthesis

N/A

NOASSERTION000

BlendShapeMaker

BlendShapeMaker python3.6

000

BunchedLPCnet

This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.

BSD-3-Clause000

EVP

Code for paper 'Audio-Driven Emotional Video Portraits'.

000

few-shot-vid2vid

Pytorch implementation for few-shot photorealistic video-to-video translation.

NOASSERTION000

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

NOASSERTION000

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

MIT000

Hierarchica_remake

000

HiFiSinger

Language:PythonMIT000

Mead

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

MIT000

mlp-singer

Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis

MIT000

Muskits

An opensource music processing toolkit

Apache-2.0000

neural-waveshaping-synthesis

efficient neural audio synthesis in the waveform domain

MPL-2.0000

Robust_Fine_Grained_Prosody_Control

PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis

BSD-3-Clause000

score_sde

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

000

ssnt-tts

An implementation of SSNT-TTS.

BSD-3-Clause000

STYLER

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, Interspeech 2021

Language:PythonMIT000

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

000

Talking-Face-Generation-DAVS

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

MIT000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

MPL-2.0000

WaveFlow

WaveFlow : A Compact Flow-based Model for Raw Audio

000

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

MIT000