JFZhouuu

JFZhouuu

Geek Repo

Github PK Tool:Github PK Tool

JFZhouuu's repositories

asv-subtools

An Open Source Tools for Speaker Recognition

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AutoSpeech

[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:0Issues:0Issues:0

DCA-PLDA

Discriminative Condition-Aware PLDA

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Language:PythonStargazers:0Issues:0Issues:0

EEND

End-to-End Neural Diarization

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Language:CudaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi-io-for-python

Python functions for reading kaldi data formats. Useful for rapid prototyping with python.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

online_speaker_change_detector

Online streaming speaker change detection model in Pytorch

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pytorch-pcen

PyTorch reimplementation of per-channel energy normalization for audio.

License:MITStargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

License:MITStargazers:0Issues:0Issues:0

SSR

(NeurIPS 2021) Pytorch implementation of paper "Re-ranking for image retrieval and transductive few-shot classification"

License:MITStargazers:0Issues:0Issues:0

VGGSound

VGGSound: A Large-scale Audio-Visual Dataset

License:NOASSERTIONStargazers:0Issues:0Issues:0

voxceleb_trainer

In defence of metric learning for speaker recognition

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

wespeaker

Production First and Production Ready Speaker Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0