hiranoyu0830

hiranoyu0830

Geek Repo

Location:Japan

Github PK Tool:Github PK Tool

hiranoyu0830's starred repositories

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

spyder

Simple Python package for fast DER computation

Language:C++License:MITStargazers:31Issues:0Issues:0

pybind11

Seamless operability between C++11 and Python

Language:C++License:NOASSERTIONStargazers:15549Issues:0Issues:0

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:3373Issues:0Issues:0

layerwise-analysis

Layer-wise analysis of self-supervised pre-trained speech representations

Language:PythonStargazers:90Issues:0Issues:0

claude.vim

Claude vim plugin for AI pair programming - a hacker's gateway to LLMs

Language:Vim ScriptLicense:MITStargazers:168Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:31Issues:0Issues:0

al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:HTMLLicense:MITStargazers:10678Issues:0Issues:0
License:Apache-2.0Stargazers:48Issues:0Issues:0

CSEnet-ASR

Cross-Speaker Encoding Network for Multi-talker Speech Recognition

Language:PythonLicense:MITStargazers:10Issues:0Issues:0

pydiardecode

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

llm_speaker_tagging

SLT 2024 Challenge: Post-ASR-Speaker-Tagging

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0

chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

Language:PythonLicense:MITStargazers:20Issues:0Issues:0

stable-ts

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Language:PythonLicense:MITStargazers:1498Issues:0Issues:0

Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

Stargazers:197Issues:0Issues:0

sms_wsj

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

Language:PythonLicense:MITStargazers:102Issues:0Issues:0

MISOnet

Unofficial Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation(MISO-BF-MISO)

Language:PythonLicense:MITStargazers:51Issues:0Issues:0

EEND-vector-clustering

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Language:PythonLicense:NOASSERTIONStargazers:70Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:4060Issues:0Issues:0

meeteval

MeetEval - A meeting transcription evaluation toolkit

Language:PythonLicense:MITStargazers:75Issues:0Issues:0

gss

A simple package for Guided source separation (GSS)

Language:PythonLicense:MITStargazers:104Issues:0Issues:0

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

License:GPL-2.0Stargazers:1008Issues:0Issues:0

C8DASR-Baseline-NeMo

NeMo: a toolkit for conversational AI

Language:PythonLicense:Apache-2.0Stargazers:12Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6852Issues:0Issues:0

dotfiles

My dotfiles (zsh + tmux 2.6 + vim 8 / nvim)

Language:LuaStargazers:7Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:20Issues:0Issues:0

EEND

End-to-End Neural Diarization

Language:PythonLicense:MITStargazers:367Issues:0Issues:0

Auto-Tuning-Spectral-Clustering

This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

Language:PythonLicense:MITStargazers:105Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:82521Issues:0Issues:0