Sangeon Yong's repositories
phoneme-informed-note-level-singing-transcription
A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023
CRNN-Phoneme-Recognizer
CRNN Phoneme Recognizer trained with TIMIT
CSD_reannotation
Re-annotation for CSD dataset for singing transcription
phoneme-informed-transcription-blog
GitHub page for A PHONEME-INFORMED NEURAL NETWORK MODEL FOR NOTE-LEVEL SINGING TRANSCRIPTION
emotiontts_open_db
로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼
golang-echo-realworld-example-app
Exemplary real world backend API built with Golang + Echo
jdcnet-pytorch
pytorch implementation of JDCNet, singing voice detection and classification network
librosa
Python library for audio and music analysis
MusicYOLO
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
onsets-and-frames
A Pytorch implementation of Onsets and Frames (Hawthorne 2018)
pytorch_memory_leak_test
memory leak test for pytorch
seyong92.github.io
A homepage for Sangeon Yong.
Skipping-The-Frame-Level
A simple yet effective Audio-to-Midi Automatic Piano Transcription system
waveform-playlist
Multitrack Web Audio editor and player with canvas waveform preview. Set cues, fades and shift multiple tracks in time. Record audio tracks or provide audio annotations. Export your mix to AudioBuffer or WAV! Project inspired by Audacity.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision