Nitin's repositories

SpeechEnhancement

Combining Weighted Multi-resolution STFT Loss and Distance Fusion to Optimize Speech Enhancement Generative Adversarial Networks

Language:PythonStargazers:57Issues:2Issues:0

SpeechEnhancement-Model-s-Deployment

SpeechEnhancement model deployment with C++

Language:C++License:GPL-3.0Stargazers:3Issues:0Issues:0

Voice-Preprocessing-Toolkit

some voice preprocessing tools in it

Language:PythonStargazers:1Issues:0Issues:0

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

de-ess

De-essing software to reduce sibilance in speech

Language:CStargazers:0Issues:1Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DSP_SibilanceDetection

A DSP algorithm designed to detect sibilance

Language:MATLABStargazers:0Issues:0Issues:0

fdndlp

A speech dereverberation algorithm, also called wpe

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

hrbeuthesis

哈尔滨工程大学本硕博学位论文LaTex模板

Language:TeXStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

nara_wpe

去混响

License:MITStargazers:0Issues:0Issues:0

noisereduce

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

P.808

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).

License:MITStargazers:0Issues:0Issues:0

reverb-algorithms

A set of scripts implementing popular reverberation audio effect algorithms.

Stargazers:0Issues:0Issues:0

reverse-interview

Questions to ask the company during your interview

License:NOASSERTIONStargazers:0Issues:0Issues:0

semetrics

Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)

Stargazers:0Issues:0Issues:0

speech_dereverbaration_using_lp_residual

This is a single channel speech dereverberation method based on DOI: 10.1109/TSA.2005.858066; implemented in MATLAB

Language:MATLABStargazers:0Issues:1Issues:0

spleeter

Deezer source separation library including pretrained models.

License:MITStargazers:0Issues:0Issues:0

SRGAN

A PyTorch implementation of SRGAN based on CVPR 2017 paper "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network"

License:MITStargazers:0Issues:0Issues:0

steerable-nafx

Steerable discovery of neural audio effects

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TFG-PitchCorrection

This repository contains all the materials generated for my end of studies project.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

uavs3e

AVS3 encoder which supports AVS3-P2 baseline profile.

License:NOASSERTIONStargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0