Yue Li (RioLLee)

RioLLee

Geek Repo

Company:Northwestern Polytechnical University

Github PK Tool:Github PK Tool

Yue Li's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:131792Issues:1117Issues:15645

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:11306Issues:132Issues:677

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5868Issues:70Issues:986

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5837Issues:56Issues:1065

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:5699Issues:32Issues:130

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3935Issues:49Issues:226

SSR-V2ray-Trojan

机场推荐与机场评测

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:3288Issues:46Issues:161

wisper

A micro library providing Ruby objects with Publish-Subscribe capabilities

ConvNeXt-V2

Code release for ConvNeXt V2 model

Language:PythonLicense:NOASSERTIONStargazers:1460Issues:7Issues:70

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonLicense:Apache-2.0Stargazers:1064Issues:17Issues:90

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonLicense:MITStargazers:584Issues:19Issues:84

tinydiarize

Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens

Language:PythonLicense:MITStargazers:417Issues:24Issues:15

EEND

End-to-End Neural Diarization

Language:PythonLicense:MITStargazers:366Issues:17Issues:46

Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

aasist

Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"

Language:PythonLicense:MITStargazers:160Issues:7Issues:9

SSL_Anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

Language:PythonLicense:MITStargazers:92Issues:4Issues:5

FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Language:PythonLicense:MITStargazers:75Issues:3Issues:11

NSD-MS2S

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

nanodrz

Speaker Diarization with Transformers

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:56Issues:6Issues:3

RawBoost-antispoofing

This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".

Language:PythonLicense:MITStargazers:49Issues:1Issues:5

mms_msg

Multipurpose Multi Speaker Mixture Signal Generator

Language:PythonStargazers:42Issues:6Issues:0

NSD-MA-MSE

A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"

Language:PerlLicense:BSD-2-ClauseStargazers:27Issues:3Issues:1

llm_speaker_tagging

SLT 2024 Challenge: Post-ASR-Speaker-Tagging

Language:PythonLicense:Apache-2.0Stargazers:11Issues:2Issues:1

LLM-Diarize-ASR-Agnostic

Repository for "LLM-based speaker diarization correction: A generalizable approach" paper

Language:Jupyter NotebookStargazers:6Issues:1Issues:0

enc_EEND

Implementation of the paper "End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors" by Shota Horiguchi et al.

Language:PythonStargazers:3Issues:0Issues:0