Yoshiki Masuyama's repositories
signal-reconstruction-from-mel-spectrogram
Audio demos for "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase."
asteroid-docker
Docker for Speech Separation and Enhancement by Using Asteroid
AmplitudeMatching
A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple loudspeakers
AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
clarity
Clarity Challenges
dcase2024_task9_baseline
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
demo-page-example
An example for audio demo page
encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
espnet
End-to-End Speech Processing Toolkit
hartufo
A Python toolkit for data-driven HRTF research
LAPChallenge
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
libri_css
Libri-CSS: dataset and evaluation pipeline
MeshRIR
MeshRIR: Dataset of room impulse responses on meshed grid points
nlg-eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
paderwasn
Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).
pykaldi2
Yet another speech toolkit based on Kaldi and PyTorch
pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
Spatial-Audio-Metrics
Spatial Audio Metrics (SAM) is a toolbox to analyse spatial audio and spatial audio perceptual experiments
spear-tools
SPEAR Challenge scripts and tools.
spear-tools-waspaa2023
Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays