speaker-recognition

There are 69 repositories under speaker-recognition topic.

NVIDIA-NeMo / NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
asr deeplearning generative-ai large-language-models machine-translation multimodal neural-networks speaker-diariazation speaker-recognition speech-synthesis speech-translation tts
Language:Python 16057
speechbrain / speechbrain
A PyTorch-based Speech Toolkit
asr audio audio-processing deep-learning huggingface language-model pytorch speaker-diarization speaker-recognition speaker-verification speech-enhancement speech-processing speech-recognition speech-separation speech-to-text speech-toolkit speechrecognition spoken-language-understanding transformers voice-recognition
Language:Python 10746
pyannote / pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
overlapped-speech-detection pretrained-models pytorch speaker-change-detection speaker-diarization speaker-embedding speaker-recognition speaker-verification speech-activity-detection speech-processing voice-activity-detection
Language:Jupyter Notebook 8651
uis-rnn
google / uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
speaker-diarization uis-rnn speaker-recognition supervised-learning clustering supervised-clustering machine-learning
Language:Python 1586
mravanelli / SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
deep-learning audio waveform filtering cnn convolutional-neural-networks speaker-recognition speaker-verification speaker-identification speech-recognition asr audio-processing speech-processing digital-signal-processing signal-processing neural-networks artificial-intelligence timit pytorch python
Language:Python 1202
clovaai / voxceleb_trainer
In defence of metric learning for speaker recognition
metric-learning speaker-recognition speaker-verification voxceleb
Language:Python 1143
yeyupiaoling / VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
arcface ecapa-tdnn pytorch speaker-recognition voice-recognition
Language:Python 1122
wenet-e2e / wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
asv campplus cnceleb dino ecapa-tdnn eres2net nist-sre plda production-ready pytorch redimnet repvgg resnet speaker-diarization speaker-recognition speaker-verification ssl voxceleb wavlm xvector
Language:Python 1074
athena-team / athena
an open-source implementation of sequence-to-sequence based speech processing engine
speech-recognition asr transformer tensorflow ctc unsupervised-learning sequence-to-sequence deployment wfst speaker-recognition tts speech-synthesis
Language:C++ 963
FluidAudio
FluidInference / FluidAudio
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
coreml ios macos speaker-diarization speaker-embedding speaker-identification speaker-recognition swift audio avfoundation real-time vad voice-activity-detection asr automatic-speech-recognition speech-to-text parakeet ane nvidia
Language:Swift 843
astorfi / 3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
3d convolutional-neural-networks deep-learning speaker-recognition
Language:Python 788
TaoRuijie / ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
ecapa-tdnn speaker-recognition speaker-verification voxceleb1 voxceleb2
Language:Python 747
cvqluu / Angular-Penalty-Softmax-Losses-Pytorch
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
metric-learning pytorch loss-functions loss-function embedding face-verification fashion-mnist fmnist-dataset face-recognition speaker-recognition sphereface arcface normface am-softmax
Language:Python 494
taylorlu / Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
uis-rnn vgg-speaker-recognition ghostvlad speaker-diarization speaker-recognition
Language:Python 494
google / speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
speaker-recognition source-separation speaker-diarization speaker-verification speaker-identification
Language:Python 434
nuaazs / VAF_2
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
antifraud microservices speaker-diarization speaker-recognition speech-recognition
Language:Python 395
speechbrain / speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
deep-learning speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speech-emotion-recognition speechrecognition speech-recognizer deeplearning neural-network neural-networks beamforming timit librispeech speech-analysis speech-api
Language:HTML 371
SamirPaulb / real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
deep-translator final-year-project googletranslator gtts gui linguasync machine-learning ml playsound python real-time-transcription speaker-recognition speech-to-speech speech-to-text speechrecognition text-to-speech tkinter translates-audio translation voice-translator
Language:Tcl 349
yeyupiaoling / VoiceprintRecognition-Tensorflow
使用Tensorflow实现声纹识别
tensorflow voice-recognition arcface speaker-recognition
Language:Python 324
manojpamk / pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
speaker-embeddings speaker-verification speaker-recognition speaker-diarization
Language:Python 320
yeyupiaoling / VoiceprintRecognition-PaddlePaddle
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
paddlepaddle voice-recognition arcface speaker-recognition ecapa-tdnn
Language:Python 288
crouchred / speaker-recognition-py3
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
speaker-recognition voice-recognition machine-learning
Language:Python 252
Walleclipse / Deep_Speaker-speaker_recognition_system
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
keras speaker-embedding speaker-recognition speech triplet-loss
Language:Python 252
NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names.
ai automatic-speech-recognition faster-whisper speaker-diarization speaker-recognition speaker-verification transcription whisper-ai
Language:Python 239
Atul-Anand-Jha / Speaker-Identification-Python
Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library
python-2 speaker-recognition speaker-identification
Language:Python 211
jymsuper / SpeakerRecognition_tutorial
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
deep-learning pytorch speaker-identification speaker-recognition speaker-verification
Language:Python 211
VITA-Group / AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
automl autospeech neural-architecture-search speaker-recognition pytorch
Language:Python 208
cvqluu / TDNN
Time delay neural network (TDNN) implementation in Pytorch using unfold method
asr pytorch speaker-diarization speaker-recognition speaker-verification speech-processing speech-recognition tdnn x-vector
Language:Python 203
IBM-Cloud / chatbot-watson-android
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
watson android chatbot conversation-service ibm-watson-services intent entity speaker-recognition speaker-diarization watson-services android-studio conversation speech dialog ibm-cloud ibm-watson workspace java ibm-cloud-solutions
Language:Java 199
lihanghang / CASR-DEMO
基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
flask-application speech-to-text ctc baidu-aip pyaudio speaker-recognition gmm casr-demo
Language:CSS 171
oscarknagg / voicemap
Identifying people from small audio fragments
machine-learning speaker-identification speaker-recognition convolutional-neural-networks
Language:Python 170
Speaker-Identification / You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks
triplet-loss siamese-networks speaker-recognition voice-authentication neural-network one-shot-learning audio speech deep-speaker speaker-identification deep-learning
Language:Jupyter Notebook 170
cvqluu / Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
kaldi tdnn tdnn-f pytorch speech-recognition speaker-recognition acoustic-model neural-network neural-networks speaker-diarization speaker-verification x-vector embedding factorized-tdnn acoustic-models
Language:Python 149
yeyupiaoling / VoiceprintRecognition-Keras
基于Kersa实现的声纹识别模型
speaker-recognition kersa tensorflow deep-learning voice-recognition
Language:Python 147
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
speaker-verification speaker-recognition speech-processing speaker-identification pytorch kaldi learnable-dictionary-encoding
Language:Perl 136
Anwarvic / Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
sidekit speaker-recognition speaker-verification speaker-identification gmm gmm-ubm i-vector ubm identity-verification identity-vector
Language:Python 114

speaker-recognition

NVIDIA-NeMo / NeMo

speechbrain / speechbrain

pyannote / pyannote-audio

google / uis-rnn

mravanelli / SincNet

clovaai / voxceleb_trainer

yeyupiaoling / VoiceprintRecognition-Pytorch

wenet-e2e / wespeaker

athena-team / athena

FluidInference / FluidAudio

astorfi / 3D-convolutional-speaker-recognition

TaoRuijie / ECAPA-TDNN

cvqluu / Angular-Penalty-Softmax-Losses-Pytorch

taylorlu / Speaker-Diarization

google / speaker-id

nuaazs / VAF_2

speechbrain / speechbrain.github.io

SamirPaulb / real-time-voice-translator

yeyupiaoling / VoiceprintRecognition-Tensorflow

manojpamk / pytorch_xvectors

yeyupiaoling / VoiceprintRecognition-PaddlePaddle

crouchred / speaker-recognition-py3

Walleclipse / Deep_Speaker-speaker_recognition_system

NavodPeiris / speechlib

Atul-Anand-Jha / Speaker-Identification-Python

jymsuper / SpeakerRecognition_tutorial

VITA-Group / AutoSpeech

cvqluu / TDNN

IBM-Cloud / chatbot-watson-android

lihanghang / CASR-DEMO

oscarknagg / voicemap

Speaker-Identification / You-Only-Speak-Once

cvqluu / Factorized-TDNN

yeyupiaoling / VoiceprintRecognition-Keras

jefflai108 / pytorch-kaldi-neural-speaker-embeddings

Anwarvic / Speaker-Recognition