Yoshiki Masuyama's starred repositories
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
SciencePlots
Matplotlib styles for scientific plotting
gammachirpy
A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)
ThunderKittens
Tile primitives for speedy kernels
LAPChallenge
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
dcase2024_task9_baseline
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
seamless_communication_emo
Foundational Models for State-of-the-Art Speech and Text Translation
DTTNet-Pytorch
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
Swin-Transformer-1d
PyTorch implementation of Swin Transformer for 1-dimensional data
NLP2024-tutorial-3
NLP2024 チュートリアル3 作って学ぶ日本語大規模言語モデル - 環境構築手順とソースコード / NLP2024 Tutorial 3: Practicing how to build a Japanese large-scale language model - Environment construction and experimental source codes
HiddenMambaAttn
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark