zhangwq740

zhangwq740

Geek Repo

Github PK Tool:Github PK Tool

zhangwq740's repositories

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

crnn-audio-classification

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

environmental-sound-classification

Environmental sound classification with Convolutional neural networks and the UrbanSound8K dataset.

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

SER-ESC-50

pytorch - DSP - audio_classification

Language:PythonStargazers:1Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

argus-freesound

Kaggle | 1st place solution for Freesound Audio Tagging 2019

License:MITStargazers:0Issues:0Issues:0

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Audio-Analysis-VAD

Different VAD algorithms using Speech features

Stargazers:0Issues:0Issues:0

catchat

A chatroom built with Flask, featured with Markdown support and code syntax highlight.

License:MITStargazers:0Issues:0Issues:0

Classification-of-Endangered-Species-using-Sound-Recognition

The main goal of this project was to build an Artificial Neural Network model with limited amount of sound data of various endangered animal species. The model can be further improved and can be used to located certain animal species in the wild.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DESED

Repo associated to the DESED dataset, download and creation of data

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

License:Apache-2.0Stargazers:0Issues:0Issues:0

mica-speech-activity-detection

Robust Speech Activity Detection (SAD) in movie audio

Stargazers:0Issues:0Issues:0

models

Models and examples built with TensorFlow

License:Apache-2.0Stargazers:0Issues:0Issues:0

pb_sed

Paderborn Sound Event Detection

License:MITStargazers:0Issues:0Issues:0

Praat_Scripts

Some basic praat scripts.

Stargazers:0Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

License:MITStargazers:0Issues:0Issues:0

python_sound_open

语音信号处理试验教程,Python代码

License:Apache-2.0Stargazers:0Issues:0Issues:0

PythonDataScienceHandbook

Python Data Science Handbook: full text in Jupyter Notebooks

License:MITStargazers:0Issues:0Issues:0

Recorder

html5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、ios部分浏览器、和Hybrid App(提供Android IOS App源码),微信也是支持的,提供H5版语音通话聊天示例 和DTMF编解码

License:MITStargazers:0Issues:0Issues:0

renren-fast-vue

renren-fast-vue基于vue、element-ui构建开发,实现renren-fast后台管理前端功能,提供一套更优的前端解决方案。

License:MITStargazers:0Issues:0Issues:0

sed-crnn

Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection winning method.

License:NOASSERTIONStargazers:0Issues:0Issues:0

sed_eval

Evaluation toolbox for Sound Event Detection

License:MITStargazers:0Issues:0Issues:0

SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

License:MITStargazers:0Issues:0Issues:0

VAD-1

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stargazers:0Issues:0Issues:0

youtube-8m

Starter code for working with the YouTube-8M dataset.

License:Apache-2.0Stargazers:0Issues:0Issues:0