There are 5 repositories under speaker-embedding topic.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A python package to build AI-powered real-time audio applications
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP 2020
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
Voxceleb1 i-vector based speaker recognition system
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals by A. Chowdhury, and A. Ross.
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.
Angular triplet center loss implementation in Pytorch.
A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
This project partially embodies the state-of-the-art practices in speaker verification technology up until 2020, while attaining the state-of-the-art performance on the VoxCeleb1 test sets.
Vector Quantized PPGs based Voice conversion
说话人识别仓库-说话人表征-ResNet/VGGVox || a ready-to-use repo for Speaker Verification / Speaker Embedding with xvector
Fast clustering of speaker embeddings for multifile speaker diarization with reappearing speakers
For further release go to: https://git-lium.univ-lemans.fr/speaker/sidekit
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
说话人识别仓库-说话人表征-dvector || a ready-to-use repo for Speaker Verification / Speaker Embedding with dvector