RioLLee

Yue Li's starred repositories

NOTSOFAR1-Challenge

NOTSOFAR-1 Challenge: Distant Diarization and ASR

Language:PythonMIT4200

jsalt2020_simulate

Training data simulation

Language:PythonApache-2.04100

wvmos

MOS score prediction by fine-tuned wav2vec2.0 model

Language:Python13600

PixIT

Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024

Language:Python2700

neural-fcasa

This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).

Language:PythonMIT2300

maskgit

Official Jax Implementation of MaskGIT

Language:Jupyter NotebookApache-2.043200

DEQDet

[ICCV 2023] Deep Equilibrium Object Detection

Language:Jupyter Notebook2300

Campus2025

2025届互联网校招信息汇总

74200

SSGD

Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"

Language:PythonApache-2.01300

gss

A simple package for Guided source separation (GSS)

Language:PythonMIT10500

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonMIT413900

LLM-Diarize-ASR-Agnostic

Repository for "LLM-based speaker diarization correction: A generalizable approach" paper

Language:Jupyter Notebook1000

llm_speaker_tagging

SLT 2024 Challenge: Post-ASR-Speaker-Tagging

Language:PythonApache-2.01300

AMI-diarization-setup

Apache-2.04800

FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Language:PythonMIT7600

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookBSD-2-Clause348000

nanodrz

Speaker Diarization with Transformers

Language:Jupyter NotebookNOASSERTION5800

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonApache-2.0113600

mms_msg

Multipurpose Multi Speaker Mixture Signal Generator

Language:Python4300

Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

20200

SSR-V2ray-Trojan

机场推荐与机场评测

365600

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.013326000

NSD-MS2S

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Language:Shell6300

dihard3_baseline

Language:PerlBSD-2-Clause2700

EEND_dataprep

Language:Shell4700

NSD-MA-MSE

A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"

Language:Shell4300

EEND

End-to-End Neural Diarization

Language:PythonMIT36800

EEND

Language:Python7100

enc_EEND

Implementation of the paper "End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors" by Shota Horiguchi et al.

Language:Python300

SSL_Anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

Language:PythonMIT10100