zhangwq740's repositories
crnn-audio-classification
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
environmental-sound-classification
Environmental sound classification with Convolutional neural networks and the UrbanSound8K dataset.
pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
SER-ESC-50
pytorch - DSP - audio_classification
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
argus-freesound
Kaggle | 1st place solution for Freesound Audio Tagging 2019
ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Audio-Analysis-VAD
Different VAD algorithms using Speech features
catchat
A chatroom built with Flask, featured with Markdown support and code syntax highlight.
Classification-of-Endangered-Species-using-Sound-Recognition
The main goal of this project was to build an Artificial Neural Network model with limited amount of sound data of various endangered animal species. The model can be further improved and can be used to located certain animal species in the wild.
DESED
Repo associated to the DESED dataset, download and creation of data
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
mica-speech-activity-detection
Robust Speech Activity Detection (SAD) in movie audio
models
Models and examples built with TensorFlow
pb_sed
Paderborn Sound Event Detection
Praat_Scripts
Some basic praat scripts.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
python_sound_open
语音信号处理试验教程,Python代码
PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
Recorder
html5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、ios部分浏览器、和Hybrid App(提供Android IOS App源码),微信也是支持的,提供H5版语音通话聊天示例 和DTMF编解码
renren-fast-vue
renren-fast-vue基于vue、element-ui构建开发,实现renren-fast后台管理前端功能,提供一套更优的前端解决方案。
sed-crnn
Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection winning method.
sed_eval
Evaluation toolbox for Sound Event Detection
SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
VAD-1
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
youtube-8m
Starter code for working with the YouTube-8M dataset.