haoxiaoyang444

haoxiaoyang444

Geek Repo

Company:Alibaba

Location:Beijing,China

Github PK Tool:Github PK Tool

haoxiaoyang444's repositories

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

License:Apache-2.0Stargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

UniAudio

The Open Source Code of UniAudio

Stargazers:0Issues:0Issues:0

so-vits-svc-5.0

Core Engine of Singing Voice Conversion & Singing Voice Clone

License:MITStargazers:0Issues:0Issues:0

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

License:NOASSERTIONStargazers:0Issues:0Issues:0

NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

License:MITStargazers:0Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

License:NOASSERTIONStargazers:0Issues:0Issues:0

repgan

RepVgg + HiFiGAN

Stargazers:0Issues:0Issues:0

SpecVQGAN

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

License:MITStargazers:0Issues:0Issues:0

pytorch_wavelets

Pytorch implementation of 2D Discrete Wavelet (DWT) and Dual Tree Complex Wavelet Transforms (DTCWT) and a DTCWT based ScatterNet

License:NOASSERTIONStargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

License:MITStargazers:0Issues:0Issues:0

multiband_melgan

An unofficial implementation of https://arxiv.org/abs/2005.05106

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0