Yoshiki Masuyama's starred repositories

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonLicense:MITStargazers:854Issues:0Issues:0

PAM

PAM is a no-reference audio quality metric for audio generation tasks

Language:PythonLicense:MITStargazers:29Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2201Issues:0Issues:0

SciencePlots

Matplotlib styles for scientific plotting

Language:PythonLicense:MITStargazers:6649Issues:0Issues:0
License:MITStargazers:10Issues:0Issues:0

gammachirpy

A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:26Issues:0Issues:0

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1257Issues:0Issues:0

KANeRF

KAN-based NeRF

Language:PythonStargazers:131Issues:0Issues:0

EvalAI

:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI

Language:PythonLicense:NOASSERTIONStargazers:1706Issues:0Issues:0
Language:PythonStargazers:16Issues:0Issues:0

agentops

Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen

Language:PythonLicense:MITStargazers:875Issues:0Issues:0

LAPChallenge

The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.

Language:Jupyter NotebookStargazers:8Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6756Issues:0Issues:0

dcase2024_task9_baseline

Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"

Language:PythonStargazers:16Issues:0Issues:0

awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

License:CC0-1.0Stargazers:1064Issues:0Issues:0

pghipy

STFT/ISTFT transforms and phase recovery using phase gradient heap integration

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

hartufo

A Python toolkit for data-driven HRTF research

Language:PythonLicense:MITStargazers:10Issues:0Issues:0

WavCraft

Official repo for WavCraft, an AI agent for audio creation and editing

Language:PythonLicense:NOASSERTIONStargazers:640Issues:0Issues:0

seamless_communication_emo

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2Issues:0Issues:0

DTTNet-Pytorch

An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation

Language:PythonLicense:Apache-2.0Stargazers:61Issues:0Issues:0

Swin-Transformer-1d

PyTorch implementation of Swin Transformer for 1-dimensional data

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4071Issues:0Issues:0

NLP2024-tutorial-3

NLP2024 チュートリアル3 作って学ぶ日本語大規模言語モデル - 環境構築手順とソースコード / NLP2024 Tutorial 3: Practicing how to build a Japanese large-scale language model - Environment construction and experimental source codes

License:Apache-2.0Stargazers:100Issues:0Issues:0

HiddenMambaAttn

Official PyTorch Implementation of "The Hidden Attention of Mamba Models"

Language:PythonStargazers:161Issues:0Issues:0

sumeval

Well tested & Multi-language evaluation framework for text summarization.

Language:PythonLicense:Apache-2.0Stargazers:602Issues:0Issues:0

brouhaha-vad

Predicts the level of noise and reverberation on your audiofiles

Language:Jupyter NotebookLicense:MITStargazers:118Issues:0Issues:0

Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Language:PythonStargazers:171Issues:0Issues:0

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonLicense:MITStargazers:4171Issues:0Issues:0

SoundCard

A Pure-Python Real-Time Audio Library

Language:PythonLicense:BSD-3-ClauseStargazers:648Issues:0Issues:0