Yoshiki Masuyama's repositories

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

s3prl

Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

signal-reconstruction-from-mel-spectrogram

Audio demos for "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase."

Language:HTMLStargazers:1Issues:1Issues:0
License:Apache-2.0Stargazers:1Issues:0Issues:0

asteroid-docker

Docker for Speech Separation and Enhancement by Using Asteroid

Language:DockerfileStargazers:0Issues:1Issues:0

AmplitudeMatching

A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple loudspeakers

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

BS-RoFormer

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

clarity

Clarity Challenges

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dcase2024_task9_baseline

Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"

Stargazers:0Issues:0Issues:0

demo-page-example

An example for audio demo page

Language:HTMLStargazers:0Issues:1Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hartufo

A Python toolkit for data-driven HRTF research

License:MITStargazers:0Issues:0Issues:0

LAPChallenge

The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.

Stargazers:0Issues:0Issues:0

libri_css

Libri-CSS: dataset and evaluation pipeline

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MeshRIR

MeshRIR: Dataset of room impulse responses on meshed grid points

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

mimo-iris

Demo page for the integration of speech separation and recognition with self-supervised learning representation

Language:HTMLStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0

nlg-eval

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

paderwasn

Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pykaldi2

Yet another speech toolkit based on Kaldi and PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Spatial-Audio-Metrics

Spatial Audio Metrics (SAM) is a toolbox to analyse spatial audio and spatial audio perceptual experiments

License:GPL-3.0Stargazers:0Issues:0Issues:0

spear-tools

SPEAR Challenge scripts and tools.

Language:PythonStargazers:0Issues:0Issues:0

spear-tools-waspaa2023

Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0