Georgehappy1's repositories

Emotional-Speech-Data

This is the GitHub page for publicly available emotional speech data.

License:MITStargazers:1Issues:1Issues:0

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models

License:MITStargazers:0Issues:1Issues:0

awesome-normalizing-flows

Awesome resources on normalizing flows.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

awesome-speech-recognition-speech-synthesis-papers

Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling

License:MITStargazers:0Issues:1Issues:0

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

Language:TeXLicense:Apache-2.0Stargazers:0Issues:1Issues:0

CharsiuG2P

Multilingual G2P in over 100 languages

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CPlusPlusThings

C++那些事

Language:C++Stargazers:0Issues:1Issues:0

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

License:Apache-2.0Stargazers:0Issues:0Issues:0

EA-SVC

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

License:MITStargazers:0Issues:0Issues:0

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CLicense:GPL-3.0Stargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

gruut

A tokenizer, text cleaner, and phonemizer for many human languages.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

hqa

Code to accompany the paper "Hierarchical Quantized Autoencoders"

License:MITStargazers:0Issues:0Issues:0

InstanceLoc

[CVPR 2021] Instance Localization for Self-supervised Detection Pretraining

License:Apache-2.0Stargazers:0Issues:0Issues:0

leetcode-master

LeetCode 刷题攻略:配思维导图,将近200道经典算法题目刷题顺序、经典算法模板、共60w字的详细图解,以及难点视频题解。按照刷题攻略上的顺序来刷题,让你在算法学习上不再迷茫!🔥🔥给个star支持一下吧!🚀

Stargazers:0Issues:1Issues:0

LeetcodeTop

汇总各大互联网公司容易考察的高频leetcode题🔥

Stargazers:0Issues:1Issues:0

leveldb

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

Language:C++License:BSD-3-ClauseStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

License:MITStargazers:0Issues:0Issues:0

speech-synthesis-paper

List of speech synthesis papers.

Stargazers:0Issues:1Issues:0

SpeechSubjectiveTest

Speech (audio) subjective evaluation system

Language:PythonStargazers:0Issues:1Issues:0

TeachYourselfCS-CN

TeachYourselfCS 的中文翻译 | A Chinese translation of TeachYourselfCS

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

ZeroSpeech

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Language:PythonStargazers:0Issues:1Issues:0

zhrtvc

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。

Stargazers:0Issues:0Issues:0