Beast code in Giters

Yunlin Chen's repositories

AnimateAnyone-unofficial

Unofficial Implementation of Animate Anyone

000

VPEval

VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation

MIT000

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Apache-2.0000

audiowmark

Audio Watermarking

GPL-3.0000

CS-Books

🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~

000

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

NOASSERTION000

NeuralVoicePuppetryMMD

This github contains the network architectures of NeuralVoicePuppetry.

NOASSERTION000

WaveGrad

Implementation of Google Brain's WaveGrad vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

BSD-3-Clause000

PPSpeech

PPSpeech: Phrase based Parallel End-to-End TTS System

000

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

000

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

MIT000

auorange

Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet

MIT000

chrome-music-lab

A collection of experiments for exploring how music works, all built with the Web Audio API.

Apache-2.0000

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

NOASSERTION000

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

MIT000

multiband_melgan

An unofficial implementation of https://arxiv.org/abs/2005.05106

MIT000

DurIAN

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

BSD-3-Clause000

crank

Non-parallel Voice Conversion

MIT000

google-research

Google Research

Apache-2.0000

pitch-net

Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).

000

spleeter

Deezer source separation library including pretrained models.

MIT000

WGANSing

Multi-voice singing voice synthesis

000

autotuner

000

speech-driven-animation

000

face-nn

游戏捏脸，基于神经风格迁移框架生成逼真人脸

MIT000

melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

BSD-3-Clause000

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION000

torch_npss

pytorch implementation of Neural Parametric Singing Synthesizer 歌声合成

MIT000

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

NOASSERTION000

vae_tacotron2

VAE Tacotron 2, an alternative of GST Tacotron

MIT000