Beast code in Giters

Yunlin Chen's starred repositories

google-research

Google Research

Language:Jupyter NotebookApache-2.033541 747 1220

avatarify-python

Avatars for Zoom, Skype and other video-conferencing apps.

Language:PythonNOASSERTION16192 321 611

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

Language:Jupyter NotebookMIT14361 351 530

stylegan

StyleGAN - Official TensorFlow Implementation

Language:PythonNOASSERTION14044 4500

pifuhd

High-Resolution 3D Human Digitization from A Single Image.

Language:PythonNOASSERTION9467 271 182

vid2vid

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Language:PythonNOASSERTION8546 247 167

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.08176 179 2335

deep-voice-conversion

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Language:PythonMIT3909 161 128

essentia

C++ library for audio and music analysis, description and synthesis, including Python bindings

Language:C++AGPL-3.02776 109 1045

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookMIT1530 45 253

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonMIT1267 36 704

TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Language:PythonNOASSERTION1116 33 95

athena

an open-source implementation of sequence-to-sequence based speech processing engine

Language:C++Apache-2.0949 37 137

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonMIT876 23 32

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonMIT609 15 12

CaricatureFace

The source code for paper "Landmark Detection and 3D Face Reconstruction for Caricature using a Nonlinear Parametric Model".

Language:Python573 38 39

Voice-Converter-CycleGAN

Voice Converter Using CycleGAN and Non-Parallel Data

Language:PythonMIT524 12 36

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

Language:PythonMIT470 18 37

LipSync

LipSync for Unity3D 根据语音生成口型动画支持fmod

Language:CMakeMIT399 12 7

g2pm

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Language:PythonApache-2.0333 15 18

durian-pytorch

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.

Language:PythonBSD-3-Clause181 8 10

crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Language:PythonMIT167 9 28

libri_css

Libri-CSS: dataset and evaluation pipeline

Language:PythonNOASSERTION129 9 7

athena

An automation platform with a plugin architecture that allows you to easily create and share services.

Language:ShellApache-2.091 16 21

基于 TensorFlow & PaddlePaddle 的通用序列标注算法库（目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF，更多算法正在持续添加中）实现中文分词（Tokenizer / segmentation）、词性标注（Part Of Speech, POS）和命名实体识别（Named Entity Recognition, NER）等序列标注任务。

Language:PythonApache-2.085 9 10

GST_Tacotron

Implementation of Global Style Token Tacotron in TensorFlow2

Language:PythonMIT25 5 6

WaveGlow

A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Language:Python20 3 1

spleeter-as-a-service

API implementation of Song Source spleeting from Spleeter by Deezer

Language:PythonMIT12 20

Tacotron-2

Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)

Language:PythonMIT1100

Animated_Avatar_LipSync

Language:Python3 10

linzai1992

Yunlin Chen's starred repositories

google-research

avatarify-python

first-order-model

stylegan

pifuhd

vid2vid

espnet

deep-voice-conversion

essentia

ParallelWaveGAN

Montreal-Forced-Aligner

TransformerTTS

athena

speechmetrics

chinese_text_normalization

CaricatureFace

Voice-Converter-CycleGAN

nara_wpe

LipSync

g2pm

durian-pytorch

crank

libri_css

athena

seq2annotation

GST_Tacotron

WaveGlow

spleeter-as-a-service

Tacotron-2

Animated_Avatar_LipSync