chenxinglili

chenxinglili

Geek Repo

Company:CASIA

Location:Beijing

Github PK Tool:Github PK Tool

chenxinglili's repositories

Two-dimensional-Self-attention-based-Speech-Enhancement

A 2-dimensional Self-attention-based Solution with Cooperative Gated Convolutional Modules for Speech Enhancement

Language:PythonStargazers:2Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

Stargazers:0Issues:0Issues:0

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Stargazers:0Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

License:NOASSERTIONStargazers:0Issues:0Issues:0

DARCN

The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"

Language:PythonStargazers:0Issues:0Issues:0

DCUNetTorchSound

Implementation of Phase-aware speech enhancement with deep complex U-Net

Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

ganhacks

starter from "How to Train a GAN?" at NIPS2016

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

KAIR

Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN

License:MITStargazers:0Issues:0Issues:0

Listening-to-Sound-of-Silence-for-Speech-Denoising

[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"

Stargazers:0Issues:0Issues:0

MSNet

Multi-scale speech enhancement

Stargazers:0Issues:0Issues:0

performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

License:MITStargazers:0Issues:0Issues:0

pika

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

License:Apache-2.0Stargazers:0Issues:0Issues:0

python-pesq

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

Language:CLicense:MITStargazers:0Issues:1Issues:0

pytorch-optimizer

torch-optimizer -- collection of optimizers for Pytorch

License:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch_cpp

Deep Learning sample programs using PyTorch in C++

License:MITStargazers:0Issues:0Issues:0

recommended-books

计算机经典书籍推荐 部分书籍提供PDF下载

License:MITStargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

License:MITStargazers:0Issues:0Issues:0

SDNet

Speaker and Direction Inferred Dual-channel Speech Separation

Stargazers:0Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

singing_transcription_ICASSP2021

The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"

Stargazers:0Issues:0Issues:0

sms_wsj

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

License:MITStargazers:0Issues:0Issues:0

SpeechTransProgress

Tracking the progress in end-to-end speech translation

License:CC0-1.0Stargazers:0Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Subband-Music-Separation

Pytorch: Channel-wise subband input for better voice and accompaniment separation

Stargazers:0Issues:0Issues:0

traditional-speech-enhancement

语音增强传统方法

License:MITStargazers:0Issues:0Issues:0

WeTS

A benchmark for the task of translation suggestion

Language:MaskLicense:UnlicenseStargazers:0Issues:1Issues:0