Georgehappy1

followers

following

stars

Georgehappy1's repositories

Emotional-Speech-Data

This is the GitHub page for publicly available emotional speech data.

MIT1 10

LLTs_VC

1 10

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models

MIT010

awesome-normalizing-flows

Awesome resources on normalizing flows.

Language:PythonMIT010

awesome-speech-recognition-speech-synthesis-papers

Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling

MIT010

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

Language:TeXApache-2.0010

CharsiuG2P

Multilingual G2P in over 100 languages

Language:Jupyter NotebookMIT010

svcdemo

03 1

Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Language:PythonMIT010

CPlusPlusThings

C++那些事

Language:C++010

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Language:C++Apache-2.0010

EA-SVC

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

Language:PythonMIT010

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CGPL-3.0010

Georgehappy1.github.io

Language:HTML020

gruut

A tokenizer, text cleaner, and phonemizer for many human languages.

Language:PythonMIT010

hqa

Code to accompany the paper "Hierarchical Quantized Autoencoders"

MIT000

InstanceLoc

[CVPR 2021] Instance Localization for Self-supervised Detection Pretraining

Apache-2.0000

leetcode-master

LeetCode 刷题攻略：配思维导图，将近200道经典算法题目刷题顺序、经典算法模板、共60w字的详细图解，以及难点视频题解。按照刷题攻略上的顺序来刷题，让你在算法学习上不再迷茫！🔥🔥给个star支持一下吧！🚀

010

LeetcodeTop

汇总各大互联网公司容易考察的高频leetcode题🔥

010

leveldb

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

Language:C++BSD-3-Clause010

lzh1.github.io

pages

Language:HTML010

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookApache-2.0010

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonApache-2.0010

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

MIT000

speech-synthesis-paper

List of speech synthesis papers.

010

SpeechSubjectiveTest

Speech (audio) subjective evaluation system

Language:Python010

TeachYourselfCS-CN

TeachYourselfCS 的中文翻译 | A Chinese translation of TeachYourselfCS

CC-BY-SA-4.0010

wespeaker

Language:Python010

ZeroSpeech

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Language:Python010

zhrtvc

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。

000