chenxinglili

followers

following

stars

CASIA

Beijing

chenxinglili's repositories

Two-dimensional-Self-attention-based-Speech-Enhancement

A 2-dimensional Self-attention-based Solution with Cooperative Gated Convolutional Modules for Speech Enhancement

Language:Python200

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Language:PythonMIT000

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonNOASSERTION000

av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

000

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

000

bark

🔊 Text-Prompted Generative Audio Model

NOASSERTION000

DARCN

The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"

Language:Python000

DCUNetTorchSound

Implementation of Phase-aware speech enhancement with deep complex U-Net

000

DeepComplexCRN

Apache-2.0000

ganhacks

starter from "How to Train a GAN?" at NIPS2016

000

GC3

Language:PythonMIT000

KAIR

Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN

MIT000

Listening-to-Sound-of-Silence-for-Speech-Denoising

[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"

000

MSNet

Multi-scale speech enhancement

000

performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

MIT000

pika

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

Apache-2.0000

python-pesq

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

Language:CMIT010

pytorch-optimizer

torch-optimizer -- collection of optimizers for Pytorch

Apache-2.0000

pytorch_cpp

Deep Learning sample programs using PyTorch in C++

MIT000

recommended-books

计算机经典书籍推荐部分书籍提供PDF下载

MIT000

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

MIT000

SDNet

Speaker and Direction Inferred Dual-channel Speech Separation

000

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Apache-2.0000

singing_transcription_ICASSP2021

The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"

000

sms_wsj

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

MIT000

SpeechTransProgress

Tracking the progress in end-to-end speech translation

CC0-1.0000

spleeter

Deezer source separation library including pretrained models.

Language:PythonMIT010

Subband-Music-Separation

Pytorch: Channel-wise subband input for better voice and accompaniment separation

000

traditional-speech-enhancement

语音增强传统方法

MIT000

WeTS

A benchmark for the task of translation suggestion

Language:MaskUnlicense010