Fu Guanyu (EricFuma)

EricFuma

Geek Repo

Company:AliPay

Location:HangZhou, China

Github PK Tool:Github PK Tool

Fu Guanyu's starred repositories

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8367Issues:0Issues:0
License:GPL-3.0Stargazers:3Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1922Issues:0Issues:0

mfa-models

Collection of pretrained models for the Montreal Forced Aligner

Language:PythonLicense:CC-BY-4.0Stargazers:111Issues:0Issues:0

StructuredLM_RTDT

A library for building hierarchical text representation and corresponding downstream applications.

Language:PythonLicense:Apache-2.0Stargazers:74Issues:0Issues:0

AEC-Challenge

AEC Challenge

License:MITStargazers:374Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:185882Issues:0Issues:0

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

Language:TeXLicense:Apache-2.0Stargazers:581Issues:0Issues:0

zhvoice

Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。

Stargazers:581Issues:0Issues:0

multimodal-speech-emotion

TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18

Language:Jupyter NotebookLicense:MITStargazers:257Issues:0Issues:0

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Language:PythonLicense:NOASSERTIONStargazers:277Issues:0Issues:0

Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Language:PythonLicense:MITStargazers:318Issues:0Issues:0

TTS-papers

🐸 collection of TTS papers

License:MPL-2.0Stargazers:621Issues:0Issues:0

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

License:MITStargazers:2960Issues:0Issues:0

hangzhou-house-guide

杭州购房指南,根据个人购房经历,总结而成的一篇买房攻略,涉及新房摇号和二手房选购,包含大量杭州城市规划资料。

Language:JavaScriptStargazers:960Issues:0Issues:0

mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Language:PythonStargazers:462Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30286Issues:0Issues:0

CI-AVSR

Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.

Language:PythonLicense:CC0-1.0Stargazers:37Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:79Issues:0Issues:0

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonLicense:MITStargazers:4492Issues:0Issues:0

End-to-End-Speech-Recognition-Learning

ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别

Stargazers:12Issues:0Issues:0

soxan

Wav2Vec for speech recognition, classification, and audio classification

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:249Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2229Issues:0Issues:0

kaldi-dnn-ali-gop

Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.

Language:C++License:NOASSERTIONStargazers:220Issues:0Issues:0

pinyin-tapt-wav2vec2

(Re)-Pre-training Wav2Vec2 on Converting Pinyin to Chinese Characters

Language:PythonStargazers:3Issues:0Issues:0

wavenet_SR

WaveNet Speech Recognition to ARRPA phonemes

Language:PythonStargazers:6Issues:0Issues:0

opencpop

Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis

Stargazers:210Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:35054Issues:0Issues:0

efficient_tts

Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

Language:PythonLicense:MITStargazers:115Issues:0Issues:0

tacotronv2_wavernn_chinese

tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)

Language:PythonStargazers:519Issues:0Issues:0