EricFuma

followers

following

stars

AliPay

HangZhou, China

Fu Guanyu's starred repositories

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0836700

Demo-of-Text-to-Speech-based-on-Deep-Learning

text to speech for mandarin,

GPL-3.0300

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT192200

mfa-models

Collection of pretrained models for the Montreal Forced Aligner

Language:PythonCC-BY-4.011100

StructuredLM_RTDT

A library for building hierarchical text representation and corresponding downstream applications.

Language:PythonApache-2.07400

AEC-Challenge

AEC Challenge

MIT37400

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++Apache-2.018588200

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

Language:TeXApache-2.058100

zhvoice

Chinese voice corpus. 中文语音语料，语音更加清晰自然，包含8个开源数据集，3200个说话人，900小时语音，1300万字。

multimodal-speech-emotion

TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18

Language:Jupyter NotebookMIT25700

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Language:PythonNOASSERTION27700

Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Language:PythonMIT31800

TTS-papers

🐸 collection of TTS papers

MPL-2.062100

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

MIT296000

hangzhou-house-guide

杭州购房指南，根据个人购房经历，总结而成的一篇买房攻略，涉及新房摇号和二手房选购，包含大量杭州城市规划资料。

Language:JavaScript96000

mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Language:Python46200

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT3028600

CI-AVSR

Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.

Language:PythonCC0-1.03700

mos-finetune-ssl

Language:PythonBSD-3-Clause7900

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonMIT449200

End-to-End-Speech-Recognition-Learning

ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别

1200

soxan

Wav2Vec for speech recognition, classification, and audio classification

Language:Jupyter NotebookApache-2.024900

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.0222900

kaldi-dnn-ali-gop

Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.

Language:C++NOASSERTION22000

pinyin-tapt-wav2vec2

(Re)-Pre-training Wav2Vec2 on Converting Pinyin to Chinese Characters

Language:Python300

wavenet_SR

WaveNet Speech Recognition to ARRPA phonemes

Language:Python600

opencpop

Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION3505400

efficient_tts

Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

Language:PythonMIT11500

tacotronv2_wavernn_chinese

tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)

Language:Python51900