zabir-nabil / awesome-speaker-recognition-verification

A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.

speaker-recognition speaker-verification speaker-identification speaker speaker-embedding deep-learning awesome-list machine-learning

A curated list of awesome speaker recognition/verification/identification papers, projects, datasets, and competition.

Table of Contents

Books
Videos and Lectures
Tutorials
Papers with Code
Pretrained models/embeddings
Papers
Github Repositories
Datasets
Conferences
Competitions
Frameworks
Tools
Miscellaneous
Contributing

Books

Fundamentals of Speaker Recognition by Beigi, Homayoon
Machine Learning for Speaker Recognition by Jen-Tzung Chien and Man-Wai Mak

Videos and Lectures

Speaker Verification - The present and future of voiceprint based security By Professor Eliathamby Ambikairajah
Identify Speaker Voice Machine learning model Neural Networks in Keras/TensorFlow
X-vectors: Robust DNN embeddings for speaker recognition
A brief Introduction to SincNet

Papers with Code

Papers

SPEECH AND SPEAKER RECOGNITION FROM RAW WAVEFORM WITH SINCNET (CNN, speech + speaker)
Deep Neural Network Embeddings for Text-Independent Speaker Verification (x-vector)
How to train your speaker embeddings extractor (VAD + speaker embeddings)

Github Repositories

https://github.com/WeidiXie/VGG-Speaker-Recognition (python 2 + tensorflow 1.x)
https://github.com/zabir-nabil/tf2-speaker-recognition (python 3 + tensorflow 2.x)
https://github.com/mravanelli/SincNet (python 3 + pytorch)

Pretrained models/embeddings

deep-speaker [softmax + triplet works best, clean audio]
meta-SR [pytorch, short utterances]

Datasets

VoxCeleb mirror
CN-Celeb
ST Chinese Mandarin Corpus
AIF [not public]
MLS [big + multi-lingual]

Conferences

ICASSP - IEEE International Conference on Acoustics, Speech and Signal Processing

Competitions

Frameworks

speechbrain

Tools

Kaldi Speech Recognition Toolkit - Extraction of x vector
PLDA/LDA from enrollment using Kaldi - PLDA scoring
Neural PLDA - Neural PLDA, kaldi

Miscellaneous

Awesome speaker recognition

Contributing

Have anything in mind that you think is awesome and would fit in this list? Feel free to send a pull request.

License

To the extent possible under law, Zabir Al Nazi has waived all copyright and related or neighboring rights to this work.

About

A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.

speaker-recognition speaker-verification speaker-identification speaker speaker-embedding deep-learning awesome-list machine-learning

MIT License