Peng Zhang (Pzhang266)

Pzhang266

Geek Repo

Company:Institute of Automation Chinese Academy of Sciences (CASIA)

Location:China Beijing

Github PK Tool:Github PK Tool

Peng Zhang's repositories

AEC-Challenge

AEC Challenge

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

Language:MATLABLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DeepXi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

Language:MATLABLicense:MPL-2.0Stargazers:0Issues:0Issues:0

dlib

A toolkit for making real world machine learning and data analysis applications in C++

Language:C++License:BSL-1.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

EMGFilters

Filter functions for processing EMG signals.

License:NOASSERTIONStargazers:0Issues:0Issues:0

fast_bss_eval

A fast implementation of bss_eval metrics for blind source separation

License:MITStargazers:0Issues:0Issues:0

FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

License:MITStargazers:0Issues:0Issues:0

gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Language:CudaLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

libfacedetection

An open source library for face detection in images. The face detection speed can reach 1000FPS.

License:NOASSERTIONStargazers:0Issues:0Issues:0

LipNet-PyTorch

The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)

Stargazers:0Issues:0Issues:0

ML-NLP

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Stargazers:0Issues:0Issues:0

MTAdam

MTAdam: Automatic Balancing of Multiple Training Loss Terms

Stargazers:0Issues:0Issues:0

MTFAA-Net

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Stargazers:0Issues:0Issues:0

Neural-Speech-Dereverberation

Machine and Deep Learning models for speech dereverberation

License:GPL-3.0Stargazers:0Issues:0Issues:0

pedalboard

🎛 🔊 A Python library for adding effects to audio.

License:GPL-3.0Stargazers:0Issues:0Issues:0

PseudoBinaural_CVPR2021

Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

pyaec

simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link adaptive filters)、frequency domain adaptive filters(frequency domain adaptive filter、frequency domain kalman filter) for acoustic echo cancellation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

License:GPL-3.0Stargazers:0Issues:0Issues:0

pytorch-revgrad

A minimal pytorch package implementing a gradient reversal layer.

License:MITStargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sEMG_DeepLearning

sEMG-based gesture recognition using deep learnig

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

solo-learn

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

License:MITStargazers:0Issues:0Issues:0

SoundSourceSeparation

The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

License:MITStargazers:0Issues:0Issues:0

VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

License:NOASSERTIONStargazers:0Issues:0Issues:0

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Stargazers:0Issues:0Issues:0

ZQCNN

一款比mini-caffe更快的Forward库,觉得好用请点星啊,400星公布快速人脸检测模型,500星公布106点landmark,600星公布人头检测模型,700星公布人脸检测套餐(六种pnet,两种rnet随意混合使用满足各种速度/精度要求),800星公布更准的106点模型

License:MITStargazers:0Issues:0Issues:0