JFZhouuu's repositories
asv-subtools
An Open Source Tools for Speaker Recognition
AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
DCA-PLDA
Discriminative Condition-Aware PLDA
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
EEND
End-to-End Neural Diarization
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
online_speaker_change_detector
Online streaming speaker change detection model in Pytorch
pytorch-pcen
PyTorch reimplementation of per-channel energy normalization for audio.
speechbrain
A PyTorch-based Speech Toolkit
spleeter
Deezer source separation library including pretrained models.
SSR
(NeurIPS 2021) Pytorch implementation of paper "Re-ranking for image retrieval and transductive few-shot classification"
VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
voxceleb_trainer
In defence of metric learning for speaker recognition
wespeaker
Production First and Production Ready Speaker Recognition Toolkit