Yunlin Chen (linzai1992)

linzai1992

Geek Repo

Company:@Microsoft

Location:Suzhou

Github PK Tool:Github PK Tool

Yunlin Chen's starred repositories

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonLicense:MITStargazers:853Issues:0Issues:0

deep-voice-conversion

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Language:PythonLicense:MITStargazers:3903Issues:0Issues:0

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonLicense:MITStargazers:586Issues:0Issues:0

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

Language:PythonLicense:MITStargazers:460Issues:0Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:1499Issues:0Issues:0

Voice-Converter-CycleGAN

Voice Converter Using CycleGAN and Non-Parallel Data

Language:PythonLicense:MITStargazers:525Issues:0Issues:0

crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Language:PythonLicense:MITStargazers:167Issues:0Issues:0

pifuhd

High-Resolution 3D Human Digitization from A Single Image.

Language:PythonLicense:NOASSERTIONStargazers:9445Issues:0Issues:0

libri_css

Libri-CSS: dataset and evaluation pipeline

Language:PythonLicense:NOASSERTIONStargazers:129Issues:0Issues:0

seq2annotation

基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注(Part Of Speech, POS)和命名实体识别(Named Entity Recognition, NER)等序列标注任务。

Language:PythonLicense:Apache-2.0Stargazers:86Issues:0Issues:0

CaricatureFace

The source code for paper "Landmark Detection and 3D Face Reconstruction for Caricature using a Nonlinear Parametric Model".

Language:PythonStargazers:572Issues:0Issues:0

durian-pytorch

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:181Issues:0Issues:0

stylegan

StyleGAN - Official TensorFlow Implementation

Language:PythonLicense:NOASSERTIONStargazers:13983Issues:0Issues:0

g2pm

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Language:PythonLicense:Apache-2.0Stargazers:331Issues:0Issues:0

TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Language:PythonLicense:NOASSERTIONStargazers:1107Issues:0Issues:0

vid2vid

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Language:PythonLicense:NOASSERTIONStargazers:8511Issues:0Issues:0

essentia

C++ library for audio and music analysis, description and synthesis, including Python bindings

Language:C++License:AGPL-3.0Stargazers:2729Issues:0Issues:0

LipSync

LipSync for Unity3D 根据语音生成口型动画 支持fmod

Language:CMakeLicense:MITStargazers:384Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0

WaveGlow

A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Language:PythonStargazers:19Issues:0Issues:0

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

Language:Jupyter NotebookLicense:MITStargazers:14262Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33162Issues:0Issues:0

Tacotron-2

Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)

Language:PythonLicense:MITStargazers:11Issues:0Issues:0

athena

an open-source implementation of sequence-to-sequence based speech processing engine

Language:C++License:Apache-2.0Stargazers:945Issues:0Issues:0

athena

An automation platform with a plugin architecture that allows you to easily create and share services.

Language:ShellLicense:Apache-2.0Stargazers:91Issues:0Issues:0

avatarify-python

Avatars for Zoom, Skype and other video-conferencing apps.

Language:PythonLicense:NOASSERTIONStargazers:16133Issues:0Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:1230Issues:0Issues:0

spleeter-as-a-service

API implementation of Song Source spleeting from Spleeter by Deezer

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

GST_Tacotron

Implementation of Global Style Token Tacotron in TensorFlow2

Language:PythonLicense:MITStargazers:25Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:7992Issues:0Issues:0