Peng Zhang's repositories
AEC-Challenge
AEC Challenge
Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
dlib
A toolkit for making real world machine learning and data analysis applications in C++
EMGFilters
Filter functions for processing EMG signals.
fast_bss_eval
A fast implementation of bss_eval metrics for blind source separation
FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
libfacedetection
An open source library for face detection in images. The face detection speed can reach 1000FPS.
LipNet-PyTorch
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)
ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
MTAdam
MTAdam: Automatic Balancing of Multiple Training Loss Terms
MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
Neural-Speech-Dereverberation
Machine and Deep Learning models for speech dereverberation
pedalboard
🎛 🔊 A Python library for adding effects to audio.
PseudoBinaural_CVPR2021
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
pyaec
simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link adaptive filters)、frequency domain adaptive filters(frequency domain adaptive filter、frequency domain kalman filter) for acoustic echo cancellation.
pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
pytorch-revgrad
A minimal pytorch package implementing a gradient reversal layer.
s3prl
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
sEMG_DeepLearning
sEMG-based gesture recognition using deep learnig
solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
SoundSourceSeparation
The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.
speechbrain
A PyTorch-based Speech Toolkit
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
VisualVoice
Audio-Visual Speech Separation with Cross-Modal Consistency
voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
ZQCNN
一款比mini-caffe更快的Forward库,觉得好用请点星啊,400星公布快速人脸检测模型,500星公布106点landmark,600星公布人头检测模型,700星公布人脸检测套餐(六种pnet,两种rnet随意混合使用满足各种速度/精度要求),800星公布更准的106点模型