ILJI CHOI's repositories
pytorch_sound
Sound Related Deep Learning Tasks boosting repository with pytorch
multiband_melgan
An unofficial implementation of https://arxiv.org/abs/2005.05106
audioset_augmentor
Sound augmentation using Large-scale audio dataset (Audioset)
FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2
chatgpt-streamlit
Simple demo project with OpenAI's API and TTS
SpeechInterface
A Speech Interface Toolkit for Neural Speech Synthesis
recording_studio_web
Sound Recording Studio Web Front Page
voicefixer_main
General Speech Restoration
cert-manager
Automatically provision and manage TLS certificates in Kubernetes
fastapi-azure-auth
Easy and secure implementation of Azure AD for your FastAPI APIs đź”’ Single- and multi-tenant support.
ksponspeech
Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.
melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
metavoice-src
Foundational model for human-like, expressive TTS
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
training-operator
Training operators on Kubernetes.
voicefixer
General Speech Restoration
WavEncoderCodes
Simple repository for handling wav format file on raw (short) data in Javascript, Kotlin (will be added?)
wavenet_vocoder
WaveNet vocoder