Eom SooHwan's repositories
beer
Bayesian spEEch Recognizer
CCFDM1
CCFDM reinforcement learning
CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
DSDA
Dual-scale Doppler Attention for Human Identification
HEAR
HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue, EMNLP 2023 (long, findings) [STARLAB] Audio Enhancement for video-dialogue system
lad
detect BPPV disorders specified by beatings, torsional movements of the eyes
SoftGroup
[CVPR 2022 Oral] SoftGroup for Instance Segmentation on 3D Point Clouds
speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
StarLab-Dialogue-System
비디오 기반 인공지능 대화시스템
Video-Scene-Complexity-Estimation
[STARLAB] This repositery is a system to estimate scene complexity in video
VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
WMRN
Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval