Yue Li (RioLLee)

RioLLee

Geek Repo

Company:Northwestern Polytechnical University

Github PK Tool:Github PK Tool

Yue Li's starred repositories

NOTSOFAR1-Challenge

NOTSOFAR-1 Challenge: Distant Diarization and ASR

Language:PythonLicense:MITStargazers:42Issues:0Issues:0

jsalt2020_simulate

Training data simulation

Language:PythonLicense:Apache-2.0Stargazers:41Issues:0Issues:0

wvmos

MOS score prediction by fine-tuned wav2vec2.0 model

Language:PythonStargazers:136Issues:0Issues:0

PixIT

Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024

Language:PythonStargazers:27Issues:0Issues:0

neural-fcasa

This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

maskgit

Official Jax Implementation of MaskGIT

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:432Issues:0Issues:0

DEQDet

[ICCV 2023] Deep Equilibrium Object Detection

Language:Jupyter NotebookStargazers:23Issues:0Issues:0

Campus2025

2025届互联网校招信息汇总

Stargazers:742Issues:0Issues:0

SSGD

Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0

gss

A simple package for Guided source separation (GSS)

Language:PythonLicense:MITStargazers:105Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:4139Issues:0Issues:0

LLM-Diarize-ASR-Agnostic

Repository for "LLM-based speaker diarization correction: A generalizable approach" paper

Language:Jupyter NotebookStargazers:10Issues:0Issues:0

llm_speaker_tagging

SLT 2024 Challenge: Post-ASR-Speaker-Tagging

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0
License:Apache-2.0Stargazers:48Issues:0Issues:0

FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Language:PythonLicense:MITStargazers:76Issues:0Issues:0

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:3480Issues:0Issues:0

nanodrz

Speaker Diarization with Transformers

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:58Issues:0Issues:0

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonLicense:Apache-2.0Stargazers:1136Issues:0Issues:0

mms_msg

Multipurpose Multi Speaker Mixture Signal Generator

Language:PythonStargazers:43Issues:0Issues:0

Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

Stargazers:202Issues:0Issues:0

SSR-V2ray-Trojan

机场推荐与机场评测

Stargazers:3656Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:133260Issues:0Issues:0

NSD-MS2S

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Language:ShellStargazers:63Issues:0Issues:0
Language:PerlLicense:BSD-2-ClauseStargazers:27Issues:0Issues:0
Language:ShellStargazers:47Issues:0Issues:0

NSD-MA-MSE

A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"

Language:ShellStargazers:43Issues:0Issues:0

EEND

End-to-End Neural Diarization

Language:PythonLicense:MITStargazers:368Issues:0Issues:0
Language:PythonStargazers:71Issues:0Issues:0

enc_EEND

Implementation of the paper "End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors" by Shota Horiguchi et al.

Language:PythonStargazers:3Issues:0Issues:0

SSL_Anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

Language:PythonLicense:MITStargazers:101Issues:0Issues:0