maoxin7676's repositories
Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
AEC_DeepModel
基于深度学习的声学回声消除基线代码
algo
数据结构和算法必知必会的50个代码实现
asteroid
The PyTorch-based audio source separation toolkit for researchers || Current highlight : we got our WHAMR results check it out here !
Audio-Classification
Code for YouTube series: Deep Learning for Audio Classification
Comparison-of-Blind-Source-Separation-techniques
Compare AIRES BSS with ILRMA and AuxIVA
Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
deeplearning_ai_books
deeplearning.ai(吴恩达老师的深度学习课程笔记及资源)
DeepXi
Deep Xi: A Deep Learning Approach to A Priori SNR Estimation. Used for Speech Enhancement and robust ASR.
DLDL-v2-PyTorch
implementation of DLDL-v2
DTLN-aec
This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.
figaro
Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵
hangzhou_house_knowledge
2017年买房经历总结出来的买房购房知识分享给大家,希望对大家有所帮助。买房不易,且买且珍惜。Sharing the knowledge of buy an own house that according to the experience at hangzhou in 2017 to all the people. It's not easy to buy a own house, so I hope that it would be useful to everyone.
HRNet-Object-Detection
Object detection with multi-level representations generated from deep high-resolution representation learning (HRNetV2h).
LPCNet
Efficient neural speech synthesis
MASP
Microphone Array Speech Processing
netron
Visualizer for deep learning and machine learning models
Nonlinear-System-Identification-with-Wavelet-Discrete-Transform
Nonlinear System Identification with Wavelet Discrete Transform
PercepNet
(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
PV_Diesel_Tool_Python
Masterprojekt KI_Betriebsstrategien PV-Diesel Generator
Sound_Localization_Algorithms
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
Speech-measure-SDR-SAR-STOI-PESQ
Speech quality measure of SDR、SAR、STOI、ESTOI、PESQ via MATLAB
Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.