Yun Zhao (zhaoyun630)

zhaoyun630

Geek Repo

Company:CloudWalk

Location:Shanghai, China

Home Page:https://zhaoyun630.github.io/

Github PK Tool:Github PK Tool

Yun Zhao's repositories

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

hybrid-multi-spk-vc

a hybrid multi-speaker voice conversion system

License:Apache-2.0Stargazers:1Issues:3Issues:0

asr_dataset

The dataset of Speech Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

auraloss

Collection of audio-focused loss functions in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

License:MITStargazers:0Issues:0Issues:0

BaiduSpider

BaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。

License:MITStargazers:0Issues:0Issues:0

CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

DeepLearning-500-questions

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06

License:GPL-3.0Stargazers:0Issues:0Issues:0

Emotional-Speech-Data

This is the GitHub page for publicly available emotional speech data.

License:MITStargazers:0Issues:1Issues:0

espnet

End-to-End Speech Processing Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

FastVocoder

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

License:MITStargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

License:MITStargazers:0Issues:0Issues:0

ICASSP2021_paper_list-VC

ICASSP 2021 accepted papers in term of voice conversion (VC)

Stargazers:0Issues:0Issues:0

kaldi-cmake

create CMakeLists.txt for kaldi

Stargazers:0Issues:0Issues:0

kaldi-onnx

Kaldi model converter to ONNX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Mengzi

Mengzi Pretrained Models

License:Apache-2.0Stargazers:0Issues:0Issues:0

MetaDialog

Platform for few-shot natural language processing: Text Classification, Sequene Labeling.

Language:PythonStargazers:0Issues:1Issues:0

models

Models and examples built with TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

nnet_pytorch

Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.

Stargazers:0Issues:0Issues:0

pointer_summarizer

pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

License:Apache-2.0Stargazers:0Issues:0Issues:0

StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

License:MITStargazers:0Issues:0Issues:0

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:0Issues:1Issues:0

WaveRNN

WaveRNN Vocoder + TTS

License:MITStargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

x-vector-pytorch

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:2Issues:0