speaker-identification

There are 29 repositories under speaker-identification topic.

alphacep / vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
android asr deep-learning deep-neural-networks deepspeech google-speech-to-text ios kaldi offline privacy python raspberry-pi speaker-identification speaker-verification speech-recognition speech-to-text speech-to-text-android stt voice-recognition vosk
Language:Jupyter Notebook 8144
mravanelli / SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
deep-learning audio waveform filtering cnn convolutional-neural-networks speaker-recognition speaker-verification speaker-identification speech-recognition asr audio-processing speech-processing digital-signal-processing signal-processing neural-networks artificial-intelligence timit pytorch python
Language:Python 1141
HarryVolek / PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
pytorch speaker-identification speaker-verification
Language:Python 578
google / speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
speaker-recognition source-separation speaker-diarization speaker-verification speaker-identification
Language:Python 372
speechbrain / speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
deep-learning speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speech-emotion-recognition speechrecognition speech-recognizer deeplearning neural-network neural-networks beamforming timit librispeech speech-analysis speech-api
Language:HTML 364
jymsuper / SpeakerRecognition_tutorial
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
deep-learning pytorch speaker-identification speaker-recognition speaker-verification
Language:Python 212
Atul-Anand-Jha / Speaker-Identification-Python
Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library
python-2 speaker-recognition speaker-identification
Language:Python 207
oscarknagg / voicemap
Identifying people from small audio fragments
machine-learning speaker-identification speaker-recognition convolutional-neural-networks
Language:Python 169
Speaker-Identification / You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks
audio deep-learning deep-speaker neural-network one-shot-learning siamese-networks speaker-identification speaker-recognition speech triplet-loss voice-authentication
Language:Jupyter Notebook 160
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
speaker-verification speaker-recognition speech-processing speaker-identification pytorch kaldi learnable-dictionary-encoding
Language:Perl 137
Anwarvic / Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
gmm gmm-ubm i-vector identity-vector identity-verification sidekit speaker-identification speaker-recognition speaker-verification ubm
Language:Python 110
SiavashShams / ssamba
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
audio audio-classification deep-learning emotion-recognition keyword-spotting mamba representation-learning self-supervised-learning speaker-identification state-space-model
Language:Python 104
funcwj / ge2e-speaker-verification
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
pytorch speaker-identification speaker-verification
Language:Python 101
kaistmm / Audio-Mamba-AuM
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
audio audio-classification audio-mamba deep-learning mamba pytorch representation-learning speaker-identification speech-classification state-space-model
Language:Python 101
Appen / UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
speech-processing speech-annotation speech-recognition speaker-diarization speech-seperation gender-classification speaker-identification synthetic-speech-detection speech-transcription topic-detection audio-segmentation accent-detection
Language:Forth 100
FAKEBOB-adversarial-attack / FAKEBOB
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
adversarial-attacks close-set-speaker-identification gmm-ubm ivector ivector-plda open-set-speaker-identification speaker-identification speaker-recognition-systems speaker-verification
Language:Python 100
cvqluu / GE2E-Loss
Pytorch implementation of Generalized End-to-End Loss for speaker verification
d-vectors ge2e pytorch speaker-diarization speaker-identification speaker-recognition speaker-verification
Language:Python 82
nezhar / speech-condenser
A tool for summarizing dialogues from videos or audio
asr speach-recognition speaker-diarization speaker-identification summarization
Language:Python 80
cyrta / voxceleb
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
corpus dataset speaker speaker-identification speaker-recognition speaker-verification speech
Language:Shell 68
mjpyeon / wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
deep-learning deep-neural-networks deepmind speaker-identification speaker-recognition speaker-verification speech-analysis speech-api speech-emotion-recognition supervised-learning wavenet-keras
Language:Python 64
Wadaboa / titanet
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
titanet nvidia speaker-recognition speaker-verification speaker-identification speaker-embeddings d-vectors ml4cv unibo
Language:Jupyter Notebook 58
mialrr / Speaker-Recognition
声纹识别(Voiceprint Recognition, VPR)，也称为说话人识别(Speaker Recognition)，有两类，即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)
python speaker-identification speaker-recognition speaker-verification voiceprint-recognition vpr
Language:Python 55
CouncilDataProject / speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification.
audio-classification transformers speaker-id speaker-identification
Language:Python 52
SuperKogito / Voice-based-speaker-identification
:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM
speaker-identification speaker-recognition gmm mfcc voice mel-frequency-cepstral-coefficients mel-frequencies gaussian-mixture-models signal machine-learning vocal speech scikit-learn scikit-learn-python
Language:Python 52
qianhwan / KaldiBasedSpeakerVerification
Kaldi based speaker verification
kaldi speaker-identification speaker-recognition speaker-verification
Language:C++ 47
jojojaeger / whisper-streamlit
this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews
asr clinical-research speaker-identification whisper
Language:Python 45
KrishnaDN / Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding
Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch
attention-model speaker-identification speaker-recognition speech
Language:Python 43
swshon / voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
i-vector kaldi speaker-embedding speaker-identification speaker-recognition speaker-verification voxceleb voxceleb1
Language:Perl 43
manthanthakker / speakerIdentificationNeuralNetworks
⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The highlight of the system is that it can identify the Speaker's voice in a Multi-Speaker Environment too. Multi-layer Perceptron (MLP) Neural Network based on error back propagation training algorithm was used to train and test the system. ⇨ The system response time was 74 µs with an average efficiency of 95%.
deep-neural-networks machine-learning neural-networks speaker-identification speaker-recognition
Language:MATLAB 36
PiotrTa / Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
speaker-embedding speaker-identification speaker-recognition speaker-verification speech-processing voice-activity-detection voice-recognition
Language:Jupyter Notebook 36
mycrazycracy / tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
kaldi kaldi-asr machine-learning neural-network speaker-identification speaker-recognition speaker-verification speech-processing tensorflow
Language:Python 32
imranparuk / speaker-recognition-3d-cnn
Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"
speaker-identification speaker-recognition keras pytorch pytorch-implmention speaker-verification keras-implimentation 3d-cnn
Language:Python 29
MaxMax2016 / VI-Speaker
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
vits speaker-embedding speaker-identification voice-clone
Language:Python 29
Picovoice / eagle
On-device speaker recognition engine powered by deep learning
speaker-embedding speaker-identification speaker-recognition
Language:Python 27
SEC4SR / SEC4SR
Source Code for 'SECurity evaluation platform FOR Speaker Recognition' released in 'Defending against Audio Adversarial Examples on Speaker Recognition Systems'
adversarial-attacks adversarial-defense speaker-identification speaker-recognition speaker-verification
Language:Python 25
Derpimort / VGGVox-PyTorch
Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.
audio-processing deep-learning pytorch speaker-identification vggvox
Language:Python 23

speaker-identification

alphacep / vosk-api

mravanelli / SincNet

HarryVolek / PyTorch_Speaker_Verification

google / speaker-id

speechbrain / speechbrain.github.io

jymsuper / SpeakerRecognition_tutorial

Atul-Anand-Jha / Speaker-Identification-Python

oscarknagg / voicemap

Speaker-Identification / You-Only-Speak-Once

jefflai108 / pytorch-kaldi-neural-speaker-embeddings

Anwarvic / Speaker-Recognition

SiavashShams / ssamba

funcwj / ge2e-speaker-verification

kaistmm / Audio-Mamba-AuM

Appen / UHV-OTS-Speech

FAKEBOB-adversarial-attack / FAKEBOB

cvqluu / GE2E-Loss

nezhar / speech-condenser

cyrta / voxceleb

mjpyeon / wavenet-classifier

Wadaboa / titanet

mialrr / Speaker-Recognition

CouncilDataProject / speakerbox

SuperKogito / Voice-based-speaker-identification

qianhwan / KaldiBasedSpeakerVerification

jojojaeger / whisper-streamlit

KrishnaDN / Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding

swshon / voxceleb-ivector

manthanthakker / speakerIdentificationNeuralNetworks

PiotrTa / Huawei-Challenge-Speaker-Identification

mycrazycracy / tf-kaldi-speaker

imranparuk / speaker-recognition-3d-cnn

MaxMax2016 / VI-Speaker

Picovoice / eagle

SEC4SR / SEC4SR

Derpimort / VGGVox-PyTorch