SangwonSUH (Simon)'s repositories
realtime_YAMNET
Simple real-time Sound Event Detector based on YAMNet and pyaudio.
audioset-downloader
Python downloader for Google AudioSet with `youtube_dl`
listenable_heatmap
Listenable explanation of heatmap in ASC task
Language:HTML000
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:PythonApache-2.0000
pt
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:PythonNOASSERTION000
VoxTube
The VoxTube dataset official repository
NOASSERTION000