speaker-embedding

There are 9 repositories under speaker-embedding topic.

pyannote / pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
overlapped-speech-detection pretrained-models pytorch speaker-change-detection speaker-diarization speaker-embedding speaker-recognition speaker-verification speech-activity-detection speech-processing voice-activity-detection
Language:Jupyter Notebook 8652
diart
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
deep-learning real-time speaker-diarization speaker-embedding streaming-audio transcription voice-activity-detection
Language:Python 1502
FluidAudio
FluidInference / FluidAudio
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
ane asr audio automatic-speech-recognition avfoundation coreml ios macos nvidia parakeet real-time speaker-diarization speaker-embedding speaker-identification speaker-recognition speech-to-text swift vad voice-activity-detection
Language:Swift 843
yistLin / dvector
Speaker embedding (d-vector) trained with GE2E loss
speaker-embedding ge2e pytorch dvector speaker-verification speaker-encoder torchscript
Language:Python 287
Walleclipse / Deep_Speaker-speaker_recognition_system
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
keras speaker-embedding speaker-recognition speech triplet-loss
Language:Python 252
Chris10M / Lip2Speech
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
deep-learning lip-reading speech-synthesis speaker-embedding lipreading real-time pytorch liptospeech
Language:Python 93
yuyq96 / D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
speech speaker-recognition speaker-verification speaker-embedding speaker-diarization speaker-adaptation time-delay-neural-network temporal-convolutional-network d-tdnn
Language:Python 89
juanmc2005 / SpeakerEmbeddingLossComparison
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP 2020
speaker-verification additive-angular-margin-loss metric-learning end-to-end-machine-learning speaker-embedding sincnet x-vector pytorch
Language:Jupyter Notebook 60
ranchlai / awesome-speaker-embedding
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
speaker-recognition speaker-verification speaker-embedding
51
swshon / voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
voxceleb voxceleb1 i-vector speaker-embedding speaker-recognition speaker-verification speaker-identification kaldi
Language:Perl 44
maxhollmann / voxceleb-luigi
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
voxceleb luigi speaker-verification speaker-embedding speaker-recognition
Language:Python 43
Picovoice / eagle
On-device speaker recognition engine powered by deep learning
speaker-identification speaker-recognition speaker-embedding
Language:Python 37
PiotrTa / Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
speaker-embedding speaker-identification speaker-verification speaker-recognition speech-processing voice-recognition voice-activity-detection
Language:Jupyter Notebook 36
iPRoBe-lab / 1D-Triplet-CNN
PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals by A. Chowdhury, and A. Ross.
convolutional-neural-networks deep-learning speaker-embedding speaker-recognition speaker-verification
Language:Python 31
PlayVoice / VI-Speaker
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
vits speaker-embedding speaker-identification voice-clone
Language:Python 30
cvqluu / dropclass_speaker
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
representation-learning speaker-recognition speaker-verification speaker-identification speaker-embedding dropout kaldi speaker-adaptation metalearning meta-learning machine-learning
Language:Python 22
bunyaminergen / awesome-speech-dataset
Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quality speech data covering various domains such as conversational, academic, political, and more.
speech-processing speaker-diarization speaker-embedding speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-emotion-recognition speech-enhancement speech-recognition speech-to-text text-to-speech
18
ductuantruong / enskd
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
speaker-verification voxceleb speaker-embedding speaker-identification speaker-recognition
Language:Python 16
zabir-nabil / awesome-speaker-recognition-verification
A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
speaker-recognition speaker-verification speaker-identification speaker speaker-embedding deep-learning awesome-list machine-learning
15
SEERNET / Voice-Prints
Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.
speaker-verification speaker-identification speaker-recognition speaker-embedding voice-authentication
14
arhtur007 / Angular-Triplet-Center-Loss
Angular triplet center loss implementation in Pytorch.
pytorch loss-functions loss-function face-recognition face-verification speaker-recognition speaker-verification speaker-embedding speaker-embeddings metric-learning
Language:Python 13
Chaanks / stklia
simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)
representation-learning speaker-recognition speaker-verification speaker-identification speaker-embedding kaldi kaldi-asr deep-learning pytorch resnet
Language:Python 10
warisqr007 / vq-ppg-vc
Vector Quantized PPGs based Voice conversion
prosody prosody-transfer speaker-embedding voice-conversion acoustic-model phonetic-posteriorgram transformer
Language:Jupyter Notebook 8
xx205 / voxsrc2020_speaker_verification
This project partially embodies the state-of-the-art practices in speaker verification technology up until 2020, while attaining the state-of-the-art performance on the VoxCeleb1 test sets.
deep-learning dpn res2net speaker-embedding speaker-verification tdnn tensorflow voxceleb voxceleb1 voxceleb2 voxsrc
Language:Python 7
satvik-dixit / MFCon
Code for the paper: Improving Speaker Representations Using Contrastive Losses on Multi-scale Features
contrastive-loss speaker-embedding speaker-verification multiscale-feature-aggregation
Language:Python 6
ZhaZhaFon / repo_voxcelebtrainer
说话人识别仓库-说话人表征-ResNet/VGGVox || a ready-to-use repo for Speaker Verification / Speaker Embedding with xvector
speaker-representatoin speaker-verification metric-learning rvector xvector speaker-embedding voxceleb
Language:Python 5
deep-privacy / sidekit
For further release go to: https://git-lium.univ-lemans.fr/speaker/sidekit
speaker-recognition speaker-verification speaker-identification speaker-embedding xvector automatic-speaker-verification asv
Language:Python 2
tzhengtek / saute
SAUTE is a lightweight transformer-based architecture adapted for dialog modeling
linear-attention machine-learning masked-language-models speaker-embedding transformers tsinghua-university
Language:Python 2
bghorvath / fastClusteringDiarizer
Fast clustering of speaker embeddings for multifile speaker diarization with reappearing speakers
agglomerative-clustering clustering knn-classification speaker-diarization speaker-embedding
Language:Jupyter Notebook 1
hujinsen / PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
speaker-verification speaker-embedding
Language:Python 1
Noah4575 / ai-debate-analysis
End-of-studies group project : Pipeline for analyzing political debates with speaker diarization, overlap detection, transcription, speaker identification, and playback. I worked on the speaker identificatio, the pipeline and playback.
deep-learning signal-processing speaker-diarization speaker-embedding speaker-identification speech-to-text
Language:Jupyter Notebook 1
z3lx / speaker-identification
Speaker identification on audio files using the pyannote/embedding model.
python speaker-embedding speaker-identification speech-processing
Language:Python 0
Ashly1991 / ecapa-ig-speaker-anonymization
ECAPA-TDNN + Integrated Gradients to explain speaker verification and the impact of pitch-shift anonymization on LibriSpeech (with EER and IG heatmaps)
ecapa-tdnn machine-learning mel-spectrogram privacy speaker-anonymization speaker-embedding speaker-recognition speaker-verification speechbrain voxceleb captum explainable-ai integrated-gradients librispeech pytorch speech-processing
Language:Jupyter Notebook
inria-defense / balr
Binary-Attribute based likelihood ratio estimation for explainable speaker recognition
binary-auto-encoder deep-neural-networks explainable-ai pytorch speaker-attributes speaker-embedding speaker-recognition
Language:Python
wa3dbk / wespeaker-finetuning
Fine-tuning scripts for WeSpeaker models (Speaker Verification, Recognition and Diarization Toolkit)
fine-tuning finetuning speaker-embedding speaker-identification speaker-recognition speaker-verification
Language:Python
ZhaZhaFon / repo_dvector
说话人识别仓库-说话人表征-dvector || a ready-to-use repo for Speaker Verification / Speaker Embedding with dvector
speaker-representation speaker-verification dvector speaker-embedding
Language:Python

speaker-embedding

pyannote / pyannote-audio

juanmc2005 / diart

FluidInference / FluidAudio

yistLin / dvector

Walleclipse / Deep_Speaker-speaker_recognition_system

Chris10M / Lip2Speech

yuyq96 / D-TDNN

juanmc2005 / SpeakerEmbeddingLossComparison

ranchlai / awesome-speaker-embedding

swshon / voxceleb-ivector

maxhollmann / voxceleb-luigi

Picovoice / eagle

PiotrTa / Huawei-Challenge-Speaker-Identification

iPRoBe-lab / 1D-Triplet-CNN

PlayVoice / VI-Speaker

cvqluu / dropclass_speaker

bunyaminergen / awesome-speech-dataset

ductuantruong / enskd

zabir-nabil / awesome-speaker-recognition-verification

SEERNET / Voice-Prints

arhtur007 / Angular-Triplet-Center-Loss

Chaanks / stklia

warisqr007 / vq-ppg-vc

xx205 / voxsrc2020_speaker_verification

satvik-dixit / MFCon

ZhaZhaFon / repo_voxcelebtrainer

deep-privacy / sidekit

tzhengtek / saute

bghorvath / fastClusteringDiarizer

hujinsen / PyTorch_Speaker_Verification

Noah4575 / ai-debate-analysis

z3lx / speaker-identification

Ashly1991 / ecapa-ig-speaker-anonymization

inria-defense / balr

wa3dbk / wespeaker-finetuning

ZhaZhaFon / repo_dvector