AppleHolic

followers

following

stars

Supertone Inc.

South Korea

https://www.linkedin.com/in/choiilji

Organizations

supertone-inc

ILJI CHOI's repositories

pytorch_sound

Sound Related Deep Learning Tasks boosting repository with pytorch

Language:PythonBSD-2-Clause84 2 16

multiband_melgan

An unofficial implementation of https://arxiv.org/abs/2005.05106

Language:PythonMIT45 4 5

audioset_augmentor

Sound augmentation using Large-scale audio dataset (Audioset)

Language:Python44 1 3

FastSpeech2

Refactored version of https://github.com/ming024/FastSpeech2

Language:PythonMIT13 2 2

chatgpt-streamlit

Simple demo project with OpenAI's API and TTS

Language:Python12 1 1

SpeechInterface

A Speech Interface Toolkit for Neural Speech Synthesis

Language:PythonMIT2 3 4

music_source_separation

Language:PythonNOASSERTION1 10

recording_studio_web

Sound Recording Studio Web Front Page

Language:VueBSD-2-Clause1 2 4

voicefixer_main

General Speech Restoration

Language:PythonAGPL-3.01 10

Appleholic

010

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonBSD-2-Clause010

AudioCLIP

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Language:Python010

cert-manager

Automatically provision and manage TLS certificates in Kubernetes

Language:GoApache-2.0000

dvector

Speaker embedding (d-vector) trained with GE2E loss

Language:Python010

fastapi-azure-auth

Easy and secure implementation of Azure AD for your FastAPI APIs 🔒 Single- and multi-tenant support.

Language:PythonMIT010

grpc-vpn

:mushroom: VPN supporting authentication such as Google OpenID Connect or AWS IAM ..., over GRPC. :shipit:

Language:GoMIT010

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT010

jsii

jsii allows code in any language to naturally interact with JavaScript classes. It is the technology that enables the AWS Cloud Development Kit to deliver polyglot libraries from a single codebase!

Language:TypeScriptApache-2.0010

ksponspeech

Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.

Language:PythonMIT010

kubeflow

Machine Learning Toolkit for Kubernetes

Language:JsonnetApache-2.0010

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Language:Jupyter NotebookBSD-3-Clause010

melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Language:PythonMIT010

metavoice-src

Foundational model for human-like, expressive TTS

Apache-2.0000

norbert

Painless Wiener filters for audio separation

Language:PythonMIT010

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Language:Jupyter NotebookMIT010

seewav

Audio waveform visualisation, converts any audio to a nice video

Language:PythonUnlicense010

training-operator

Training operators on Kubernetes.

Language:PythonApache-2.0010

voicefixer

General Speech Restoration

Language:PythonMIT010

WavEncoderCodes

Simple repository for handling wav format file on raw (short) data in Javascript, Kotlin (will be added?)

Language:Kotlin020

wavenet_vocoder

WaveNet vocoder

Language:PythonNOASSERTION010