户建坤's starred repositories
tensorflow
An Open Source Machine Learning Framework for Everyone
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
Focal-Loss-Pytorch
全中文注释.(The loss function of retinanet based on pytorch).(You can use it on one-stage detection task or classifical task, to solve data imbalance influence).用于one-stage目标检测算法,提升检测效果.你也可以在分类任务中使用该损失函数,解决数据不平衡问题.
streamlit-audio-recorder
Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)
Speech-enhancement
Deep neural network based speech enhancement toolkit
Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
AI_beatmap_generator
尝试使用神经网络生成音乐游戏Malody的谱面。
ram_modified
"Recurrent Models of Visual Attention" in TensorFlow
sound_event_detection
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
mica-speech-activity-detection
Robust Speech Activity Detection (SAD) in movie audio
DomainAdversarialVoiceActivityDetection
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
audio_augment
A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN
musan_investigation_cnn_rnn
Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN corpus.
MultiTarget_VAD
Representation of Paper: On training targets for noise-robust voice activity detection.