maoxin7676's repositories

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Language:PythonStargazers:1Issues:1Issues:0

AEC_DeepModel

基于深度学习的声学回声消除基线代码

Language:PythonStargazers:0Issues:0Issues:0

algo

数据结构和算法必知必会的50个代码实现

Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers || Current highlight : we got our WHAMR results check it out here !

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Audio-Classification

Code for YouTube series: Deep Learning for Audio Classification

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Comparison-of-Blind-Source-Separation-techniques

Compare AIRES BSS with ILRMA and AuxIVA

Language:MATLABLicense:MITStargazers:0Issues:0Issues:0

Conv-TasNet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Language:PythonStargazers:0Issues:0Issues:0

deeplearning_ai_books

deeplearning.ai(吴恩达老师的深度学习课程笔记及资源)

Language:HTMLStargazers:0Issues:0Issues:0

DeepXi

Deep Xi: A Deep Learning Approach to A Priori SNR Estimation. Used for Speech Enhancement and robust ASR.

License:MPL-2.0Stargazers:0Issues:0Issues:0

DLDL-v2-PyTorch

implementation of DLDL-v2

Language:PythonStargazers:0Issues:1Issues:0

DTLN-aec

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

License:MITStargazers:0Issues:0Issues:0

figaro

Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵

License:GPL-3.0Stargazers:0Issues:0Issues:0

hangzhou_house_knowledge

2017年买房经历总结出来的买房购房知识分享给大家,希望对大家有所帮助。买房不易,且买且珍惜。Sharing the knowledge of buy an own house that according to the experience at hangzhou in 2017 to all the people. It's not easy to buy a own house, so I hope that it would be useful to everyone.

Language:CSSStargazers:0Issues:0Issues:0

HRNet-Object-Detection

Object detection with multi-level representations generated from deep high-resolution representation learning (HRNetV2h).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LPCNet

Efficient neural speech synthesis

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

MASP

Microphone Array Speech Processing

License:MITStargazers:0Issues:0Issues:0

netron

Visualizer for deep learning and machine learning models

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

Nonlinear-System-Identification-with-Wavelet-Discrete-Transform

Nonlinear System Identification with Wavelet Discrete Transform

License:GPL-3.0Stargazers:0Issues:0Issues:0

PercepNet

(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

PV_Diesel_Tool_Python

Masterprojekt KI_Betriebsstrategien PV-Diesel Generator

Stargazers:0Issues:0Issues:0
License:BSD-3-ClauseStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

Sound_Localization_Algorithms

Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.

Language:MATLABStargazers:0Issues:0Issues:0

Speech-measure-SDR-SAR-STOI-PESQ

Speech quality measure of SDR、SAR、STOI、ESTOI、PESQ via MATLAB

Stargazers:0Issues:0Issues:0

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Stargazers:0Issues:0Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Stargazers:0Issues:0Issues:0

TAC

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Stargazers:0Issues:0Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

License:Apache-2.0Stargazers:0Issues:0Issues:0