Yiwen Wang's starred repositories

neural-IIR-field

Neural IIR Filter Field for HRTF Upsampling and Personalization

Language:PythonLicense:AGPL-3.0Stargazers:14Issues:0Issues:0

MossFormer2

This is the audio sample repository for speech separation model "MossFormer2".

Language:PythonLicense:MITStargazers:68Issues:0Issues:0
Language:PythonStargazers:141Issues:0Issues:0

GenerativeSourceSeparation

Open source code for the paper 'Music Source Separation with Generative Flow'

Language:Jupyter NotebookLicense:MITStargazers:20Issues:0Issues:0

llm-tse

Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)

Language:JavaScriptStargazers:30Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12016Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34083Issues:0Issues:0

CLAPSep

Query-conditioned target sound extraction model

Language:PythonStargazers:11Issues:0Issues:0

U-Mamba

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Language:PythonLicense:Apache-2.0Stargazers:598Issues:0Issues:0

DNN-based_source_separation

A PyTorch implementation of DNN-based source separation.

Language:PythonStargazers:280Issues:0Issues:0
Language:PythonLicense:MITStargazers:70Issues:0Issues:0

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2481Issues:0Issues:0

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2683Issues:0Issues:0

ssast

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Language:PythonLicense:BSD-3-ClauseStargazers:358Issues:0Issues:0

sound_generation

Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021

Language:PythonStargazers:67Issues:0Issues:0

FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization

Language:PythonStargazers:69Issues:0Issues:0

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Language:ShellStargazers:372Issues:0Issues:0

lgtfb-en

Learnable Gammatone Filterbank (LGTFB) and Equal-loudness Normalization (EN)

Language:PythonStargazers:12Issues:0Issues:0

mp-gtf

Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python

Language:PythonLicense:NOASSERTIONStargazers:46Issues:0Issues:0

ESC-50

ESC-50: Dataset for Environmental Sound Classification

Language:PythonLicense:NOASSERTIONStargazers:1318Issues:0Issues:0

Wave-U-Net

Implementation of the Wave-U-Net for audio source separation

Language:PythonLicense:MITStargazers:813Issues:0Issues:0

fdndlp

A speech dereverberation algorithm, also called wpe

Language:PythonLicense:MITStargazers:146Issues:0Issues:0

TRUNet

unofficial PyTorch implementation of 《REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET》

Language:PythonStargazers:87Issues:0Issues:0

NBSS

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

Language:PythonLicense:MITStargazers:183Issues:0Issues:0
Language:PythonLicense:MITStargazers:29Issues:0Issues:0

seld-dcase2020

Baseline method for sound event localization task of DCASE 2020 challenge

Language:PythonLicense:NOASSERTIONStargazers:52Issues:0Issues:0

icoDOA

Code repository for the paper Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs

Language:PythonLicense:AGPL-3.0Stargazers:28Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:192Issues:0Issues:0

MelGAN-Pytorch

A Pytorch Implementation of MelGAN

Language:Jupyter NotebookStargazers:67Issues:0Issues:0

UniAudio

The Open Source Code of UniAudio

Language:PythonStargazers:488Issues:0Issues:0