JaesungHuh

The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmented-reality (AR) -motivated multi-sensor egocentric world view.

NOASSERTION000

ego_actrecog_analysis

Language:PythonNOASSERTION000

jaesunghuh.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

laughter-detection

Language:PythonMIT000

voxceleb_trainer

In defence of metric learning for speaker recognition

Language:PythonMIT000

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

MIT000

research_projectpage

000

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Apache-2.0000

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:PythonMIT000