wyw97

Yiwen Wang's starred repositories

neural-IIR-field

Neural IIR Filter Field for HRTF Upsampling and Personalization

Language:PythonAGPL-3.01400

MossFormer2

This is the audio sample repository for speech separation model "MossFormer2".

Language:PythonMIT6800

multi-source-diffusion-models

Language:Python14100

GenerativeSourceSeparation

Open source code for the paper 'Music Source Separation with Generative Flow'

Language:Jupyter NotebookMIT2000

llm-tse

Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)

Language:JavaScript3000

mamba

Mamba SSM architecture

Language:PythonApache-2.01201600

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT3408300

CLAPSep

Query-conditioned target sound extraction model

Language:Python1100

U-Mamba

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Language:PythonApache-2.059800

DNN-based_source_separation

A PyTorch implementation of DNN-based source separation.

Language:Python28000

causal-transformer-decoder

Language:PythonMIT7000

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.0248100

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonApache-2.0268300

ssast

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Language:PythonBSD-3-Clause35800

sound_generation

Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021

Language:Python6700

FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization

Language:Python6900

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Language:Shell37200

lgtfb-en

Learnable Gammatone Filterbank (LGTFB) and Equal-loudness Normalization (EN)

Language:Python1200

mp-gtf

Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python

Language:PythonNOASSERTION4600

ESC-50

ESC-50: Dataset for Environmental Sound Classification

Language:PythonNOASSERTION131800

Wave-U-Net

Implementation of the Wave-U-Net for audio source separation

Language:PythonMIT81300

fdndlp

A speech dereverberation algorithm, also called wpe

Language:PythonMIT14600

TRUNet

unofficial PyTorch implementation of 《REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET》

Language:Python8700

NBSS

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

Language:PythonMIT18300

sgmse-bbed

TODO

Language:PythonMIT2900

seld-dcase2020

Baseline method for sound event localization task of DCASE 2020 challenge

Language:PythonNOASSERTION5200

icoDOA

Code repository for the paper Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs

Language:PythonAGPL-3.02800

RMVPE

Language:PythonApache-2.019200

MelGAN-Pytorch

A Pytorch Implementation of MelGAN

Language:Jupyter Notebook6700

UniAudio

The Open Source Code of UniAudio

Language:Python48800