zhaoyun630

Yun Zhao's repositories

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonMIT1 10

hybrid-multi-spk-vc

a hybrid multi-speaker voice conversion system

Apache-2.01 30

asr_dataset

The dataset of Speech Recognition

Apache-2.0000

auraloss

Collection of audio-focused loss functions in PyTorch

Language:PythonApache-2.0010

autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Language:PythonMIT010

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

MIT000

BaiduSpider

BaiduSpider，一个爬取百度搜索结果的爬虫，目前支持百度网页搜索，百度图片搜索，百度知道搜索，百度视频搜索，百度资讯搜索，百度文库搜索，百度经验搜索和百度百科搜索。

MIT000

CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Language:PythonMIT010

COVID-Dialogue

Language:Python010

DeepLearning-500-questions

GPL-3.0000

Emotional-Speech-Data

This is the GitHub page for publicly available emotional speech data.

MIT010

espnet

End-to-End Speech Processing Toolkit

Apache-2.0000

FastVocoder

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

MIT000

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT010

ICASSP2021_paper_list-VC

ICASSP 2021 accepted papers in term of voice conversion (VC)

010

kaldi-cmake

create CMakeLists.txt for kaldi

000

kaldi-onnx

Kaldi model converter to ONNX

Language:PythonApache-2.0010

Mengzi

Mengzi Pretrained Models

Apache-2.0010

MetaDialog

Platform for few-shot natural language processing: Text Classification, Sequene Labeling.

Language:Python010

models

Models and examples built with TensorFlow

Language:PythonApache-2.0010

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

Language:PythonBSD-3-Clause010

nnet_pytorch

Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.

000

pointer_summarizer

pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Language:PythonApache-2.0010

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Apache-2.0000

StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Language:PythonMIT010

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookMPL-2.0010

WaveRNN

WaveRNN Vocoder + TTS

Language:PythonMIT010

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonApache-2.0010

x-vector-pytorch

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

Language:Python010

zhaoyun630.github.io

Language:HTML020