RioLLee

Yue Li's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0131792 1117 15645

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-2-Clause11306 132 677

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT5868 70 986

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonNOASSERTION5837 56 1065

ConvNeXt

Code release for ConvNeXt model

Language:PythonMIT5699 32 130

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonMIT3935 49 226

SSR-V2ray-Trojan

机场推荐与机场评测

3334 430

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookBSD-2-Clause3288 46 161

wisper

A micro library providing Ruby objects with Publish-Subscribe capabilities

Language:Ruby3259 49 100

ConvNeXt-V2

Code release for ConvNeXt V2 model

Language:PythonNOASSERTION1460 7 70

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonApache-2.01064 17 90

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonMIT584 19 84

tinydiarize

Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens

Language:PythonMIT417 24 15

EEND

End-to-End Neural Diarization

Language:PythonMIT366 17 46

Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

187 100

aasist

Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"

Language:PythonMIT160 7 9

SSL_Anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

Language:PythonMIT92 4 5

FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Language:PythonMIT75 3 11

EEND

Language:Python70 8 8

NSD-MS2S

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Language:Shell60 3 8

nanodrz

Speaker Diarization with Transformers

Language:Jupyter NotebookNOASSERTION56 6 3

RawBoost-antispoofing

This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".

Language:PythonMIT49 1 5

AMI-diarization-setup

Apache-2.047 50

EEND_dataprep

Language:Shell47 5 8

mms_msg

Multipurpose Multi Speaker Mixture Signal Generator

Language:Python42 60

NSD-MA-MSE

A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"

Language:Shell40 4 2

dihard3_baseline

Language:PerlBSD-2-Clause27 3 1

llm_speaker_tagging

SLT 2024 Challenge: Post-ASR-Speaker-Tagging

Language:PythonApache-2.011 2 1

LLM-Diarize-ASR-Agnostic

Repository for "LLM-based speaker diarization correction: A generalizable approach" paper

Language:Jupyter Notebook6 10

enc_EEND

Implementation of the paper "End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors" by Shota Horiguchi et al.

Language:Python300