Cheul Young Park's repositories
K-EmoCon_SupplementaryCodes
Supplementary codes for the K-EmoCon dataset
deepo-code-server
Code-server on top of Deepo environment for quick and easy deep learning research.
AdaptiveESM
A proof-of-concept for an adaptive sampling method to collect emotions in the wild.
AutoPST
Global Rhythm Style Transfer Without Text Transcriptions
autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
companion-app
AI companions with memory: a lightweight stack to create and host your own AI companions
datasets
🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Deep-U-Net-Pytorch
Implementation of Deep U-Net in PyTorch
ssh-cuda
SSH-able CUDA server based on NVIDIA Docker image.
deepo
Deepo environment images
ffmpeg-normalize
Audio Normalization for Python/ffmpeg
KoBERT
Korean BERT pre-trained cased (KoBERT)
KoSentenceBERT-SKT
🌼 Korean SentenceBERT : Sentence Embeddings using Siamese BERT-Networks using SKT KoBERT and kakaobrain KorNLU dataset
musicinformationretrieval.com
Instructional notebooks on music information retrieval.
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
pdt-progressbot
Slack Bot for checking the number of patients registered for research.
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
pytorch-facial_landmark_detection
PyTorch implementation of facial landmark detection for mobile devices
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity and Number Detector
SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
TEAP
Toolbox for Emotion Analysis using Physiological signals
You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks