hiranoyu0830's starred repositories
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
layerwise-analysis
Layer-wise analysis of self-supervised pre-trained speech representations
claude.vim
Claude vim plugin for AI pair programming - a hacker's gateway to LLMs
CSEnet-ASR
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
pydiardecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
llm_speaker_tagging
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
EEND-vector-clustering
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
C8DASR-Baseline-NeMo
NeMo: a toolkit for conversational AI
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"