shen912zzz

shen912zzz

Geek Repo

Github PK Tool:Github PK Tool

shen912zzz's starred repositories

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33449Issues:749Issues:1216

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8128Issues:177Issues:2328

ESC-50

ESC-50: Dataset for Environmental Sound Classification

Language:PythonLicense:NOASSERTIONStargazers:1305Issues:31Issues:11

VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

Language:PythonLicense:Apache-2.0Stargazers:698Issues:8Issues:65

pyloudnorm

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Language:PythonLicense:MITStargazers:606Issues:14Issues:36

SAB-cnn-audio-denoiser

Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancement

Language:Jupyter NotebookStargazers:251Issues:10Issues:15

MoSQITo

MoSQITo is a unified and modular development framework of key sound quality metrics favoring reproducible science and efficient shared scripting among engineers, teachers and researchers community.

Language:PythonLicense:Apache-2.0Stargazers:125Issues:13Issues:46

audio_dataset_screener

An auxiliary tool for manual screening of audio dataset.

Language:C#License:GPL-3.0Stargazers:106Issues:2Issues:3

audio_dataset_vpr

A voiceprint recognition classifier for audio dataset

Language:PythonLicense:GPL-3.0Stargazers:82Issues:2Issues:2

Urbansound8k

Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.

Language:Jupyter NotebookStargazers:65Issues:2Issues:0

Audio-Classification-using-CNN-MLP

Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.

STgram-MFN

A spectro-temporal fusion feature, STgram, with MobileFaceNet For more stable Anomalous Sound Detection

Language:PythonStargazers:58Issues:0Issues:0

environmental-sound-classification

Environmental sound classification with Convolutional neural networks and the UrbanSound8K dataset.

Language:Jupyter NotebookLicense:MITStargazers:53Issues:1Issues:2

SQAT

SQAT is an open-source repository of MATLAB codes containing the implementation of key metrics for quantitative sound quality analysis.

Language:MATLABLicense:NOASSERTIONStargazers:35Issues:5Issues:3

Soundscapy

A python library for soundscape assessments

Language:PythonLicense:BSD-3-ClauseStargazers:34Issues:4Issues:21

SciDataTool

SciDataTool is an open-source Python package for scientific data handling. The objective is to provide a user-friendly, unified, flexible module to postprocess any kind of signal. It is meant to be used by researchers, R&D engineers and teachers in any scientific area. This package allows to efficiently store data fields in the time/space or in the frequency domain, to easily perform Fourier Transforms, to extract slices, to convert units, to compare several fields, etc. It therefore leads to simplified plot commands.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:26Issues:8Issues:40

human-voice-detection

Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.

Language:PythonStargazers:26Issues:0Issues:0

pyAudioKits

Powerful Python audio workflow support based on librosa and other libraries

Language:Jupyter NotebookLicense:MITStargazers:24Issues:0Issues:0

Add_noise_and_rir_to_speech

The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generate far-field speech data using room impulse response data from BUT Speech@FIT Reverb Database.

Language:PythonLicense:MITStargazers:22Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:22Issues:3Issues:2

TWFR-GMM

Time-weighted Frequency Domain Audio Representation (TWFR) with GMM Estimator for Anomalous Sound Detection

Language:PythonLicense:MITStargazers:18Issues:0Issues:0

bird_audio_detection_challenge

DenseNets for the detection of singing birds in audio files

Language:PythonLicense:NOASSERTIONStargazers:17Issues:5Issues:2

dcase2022

Submission for task 2 "Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques" of the DCASE challenge 2022 (https://dcase.community/challenge2022/task-unsupervised-anomalous-sound-detection-for-machine-condition-monitoring).

Language:PythonLicense:GPL-3.0Stargazers:12Issues:1Issues:3

sub-cluster-AdaCos

Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.

Language:PythonLicense:GPL-3.0Stargazers:10Issues:0Issues:0

multimodal-dl-framework

An extensible PyTorch framework to experiment with neural-networks-based deep learning algorithms on multiple data modalities for binary classification.

Language:PythonLicense:MITStargazers:8Issues:4Issues:5

Human-Pose-Estimation---Motion-Capture-Device

Inertial Human Motion Capture Device - Submodule with GY-87 for Pose Data Acquisition, ESP32 for Pose Estimation, and UDP Connection to PC for Pose Reconstruction

Language:C#License:GPL-3.0Stargazers:5Issues:0Issues:0

SSDPT

Codes for SSDPT: Self-Supervised Dual-Path Transformer for Anomalous Sound Detection

Language:PythonStargazers:5Issues:0Issues:0

Animal-Sound-Classifier-using-Watson-Studio

Build classification models using IBM Watson Studio to predict (identify) animal sounds. Learn how to best gather and prepare data, create and deploy models, deploy and test a signal processing application, create models with binary classifications, and display the predictions on a web page created using Node-RED.

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

AudioEventLabeller

EchoMarks: Dataset Annotation for Audio Event Detection

Language:PythonStargazers:2Issues:0Issues:0