Audio-WestlakeU

Audio-WestlakeU

Geek Repo

Audio Signal and Information Processing Lab at Westlake University

Location:Hangzhou

Home Page:https://audio.westlake.edu.cn/

Github PK Tool:Github PK Tool

Audio-WestlakeU's repositories

FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Language:PythonLicense:MITStargazers:511Issues:10Issues:60

NBSS

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

Language:PythonLicense:MITStargazers:163Issues:6Issues:30

McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Language:PythonLicense:NOASSERTIONStargazers:66Issues:5Issues:8

FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization

FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Language:PythonLicense:MITStargazers:61Issues:3Issues:11

ATST-SED

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

Language:Jupyter NotebookLicense:MITStargazers:50Issues:3Issues:11

RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

Language:PythonLicense:MITStargazers:31Issues:3Issues:4

pytorch_lightning_template_for_beginners

A pytorch template for beginners based on pytorch_lightning

Language:PythonStargazers:28Issues:3Issues:0

RCT

This repo gives the code for the official implementation of RCT.

Language:PythonStargazers:12Issues:3Issues:0

UMA-ASR

This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".

Language:PythonStargazers:8Issues:3Issues:0
Language:MATLABStargazers:5Issues:2Issues:0

Audio-WestlakeU.github.io

Audio and Signal Information Processing Lab in Westlake University concentrates on speech processing algorithm

License:MITStargazers:3Issues:2Issues:0

ATST-RCT

ATST-RCT model for DCASE 2022 task4.

Language:MATLABStargazers:0Issues:2Issues:0
Language:MATLABStargazers:0Issues:2Issues:0
Language:MATLABStargazers:0Issues:2Issues:0
Language:MATLABStargazers:0Issues:2Issues:0