Beast code in Giters

Yoshiki Masuyama's repositories

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT100

s3prl

Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)

Language:PythonApache-2.0100

signal-reconstruction-from-mel-spectrogram

Audio demos for "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase."

Language:HTML1 10

SPMamba

Apache-2.0100

asteroid-docker

Docker for Speech Separation and Enhancement by Using Asteroid

Language:Dockerfile010

AmplitudeMatching

A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple loudspeakers

Language:Jupyter NotebookMIT000

asteroid_jaCappella

Language:PythonMIT000

AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Language:PythonNOASSERTION000

BS-RoFormer

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

MIT000

BSRNN

000

clarity

Clarity Challenges

Language:PythonMIT000

dcase2024_task9_baseline

Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"

000

demo-page-example

An example for audio demo page

Language:HTML010

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonNOASSERTION000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0000

hartufo

A Python toolkit for data-driven HRTF research

MIT000

HRTF-upsampling-with-a-generative-adversarial-network-using-a-gnomonic-equiangular-projection

000

LAPChallenge

The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.

000

libri_css

Libri-CSS: dataset and evaluation pipeline

Language:PythonNOASSERTION000

MeshRIR

MeshRIR: Dataset of room impulse responses on meshed grid points

Language:Jupyter NotebookCC-BY-4.0000

mimo-iris

Demo page for the integration of speech separation and recognition with self-supervised learning representation

Language:HTML020

mvae-ss

Language:Python000

nlg-eval

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

Language:PythonNOASSERTION000

paderwasn

Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).

Language:PythonMIT000

pykaldi2

Yet another speech toolkit based on Kaldi and PyTorch

Language:PythonMIT000

pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

Language:PythonGPL-3.0000

Spatial-Audio-Metrics

Spatial Audio Metrics (SAM) is a toolbox to analyse spatial audio and spatial audio perceptual experiments

GPL-3.0000

spear-tools

SPEAR Challenge scripts and tools.

Language:Python000

spear-tools-waspaa2023

Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays

Language:PythonApache-2.0000

whisper-asr-finetune

Language:PythonMIT000