chenxinglili's repositories
Two-dimensional-Self-attention-based-Speech-Enhancement
A 2-dimensional Self-attention-based Solution with Cooperative Gated Convolutional Modules for Speech Enhancement
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
bark
🔊 Text-Prompted Generative Audio Model
DARCN
The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"
DCUNetTorchSound
Implementation of Phase-aware speech enhancement with deep complex U-Net
ganhacks
starter from "How to Train a GAN?" at NIPS2016
KAIR
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN
Listening-to-Sound-of-Silence-for-Speech-Denoising
[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"
MSNet
Multi-scale speech enhancement
performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
pika
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
python-pesq
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
pytorch-optimizer
torch-optimizer -- collection of optimizers for Pytorch
pytorch_cpp
Deep Learning sample programs using PyTorch in C++
recommended-books
计算机经典书籍推荐 部分书籍提供PDF下载
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
SDNet
Speaker and Direction Inferred Dual-channel Speech Separation
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
singing_transcription_ICASSP2021
The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"
sms_wsj
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
SpeechTransProgress
Tracking the progress in end-to-end speech translation
Subband-Music-Separation
Pytorch: Channel-wise subband input for better voice and accompaniment separation
traditional-speech-enhancement
语音增强传统方法