makabakas's repositories
Binaural-Source-Localization-CNN
A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microphone inputs.
DaNet-Tensorflow
Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"
deep-clustering
A tensorflow implementation for Deep clustering: Discriminative embeddings for segmentation and separation
deep_complex_networks
Implementation related to the Deep Complex Networks
DPMM-Clustering
Java implementation of Dirichlet Process Mixture Model.
HumHum
a project of digital video and audio processing
messl
Model-based EM Source Separation and Localization
Multi-channel-speech-extraction-using-DNN
A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction
Multi-User-Transmit-Beamforming-Linear-Regression-Convex-Optimization-Tutorial
In this work, we use convex optimization package in MATLAB to implement multi-user transmit beamforming problem and linear regression. This is the homework 2 of ELEC 5470 Convex Optimization, HKUST.
News_Spark
基于Spark2.x新闻网大数据实时分析可视化系统项目
nn-gev
Neural network supported GEV beamformer
phd-thesis
Hagen Wierstorf - Perceptual Assessment of Sound Field Synthesis, PhD thesis, TU Berlin
php-sdhumming
PHP binding for SDHumming
sherpa-onnx
Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin
speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
Spherical-Array-Processing
A collection of MATLAB routines for acoustical array processing on spherical harmonic signals, commonly captured with a spherical microphone array.
Voice-Identification
Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.
WavLoc
End-to-End binaural sound localization