Nitin's repositories
SpeechEnhancement
Combining Weighted Multi-resolution STFT Loss and Distance Fusion to Optimize Speech Enhancement Generative Adversarial Networks
SpeechEnhancement-Model-s-Deployment
SpeechEnhancement model deployment with C++
Voice-Preprocessing-Toolkit
some voice preprocessing tools in it
basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
DSP_SibilanceDetection
A DSP algorithm designed to detect sibilance
hrbeuthesis
哈尔滨工程大学本硕博学位论文LaTex模板
nara_wpe
去混响
noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
reverb-algorithms
A set of scripts implementing popular reverberation audio effect algorithms.
reverse-interview
Questions to ask the company during your interview
semetrics
Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
speech_dereverbaration_using_lp_residual
This is a single channel speech dereverberation method based on DOI: 10.1109/TSA.2005.858066; implemented in MATLAB
spleeter
Deezer source separation library including pretrained models.
SRGAN
A PyTorch implementation of SRGAN based on CVPR 2017 paper "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network"
steerable-nafx
Steerable discovery of neural audio effects
TFG-PitchCorrection
This repository contains all the materials generated for my end of studies project.
uavs3e
AVS3 encoder which supports AVS3-P2 baseline profile.
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit