Quan Wang's repositories
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
SpectralCluster
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
VoiceIdentityBook
《声纹技术:从核心算法到工程实践》
SpeakerRecognitionFromScratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
video-average-frame
Python tool to compute the average frame of a video.
CurriculumVitae
Curriculum Vitae of Quan Wang
VB_diarization
VB Diarization with Eigenvoice and HMM Priors, refactored
HMRF-EM-image
Implementation of the Hidden Markov Random Field Model and its Expectation-Maximization Algorithm
SpeakerVerSim
Python-based simulation framework for different version control strategies of speaker recognition systems.
dynamic_time_warping
This package implements Dynamic Time Warping (DTW).
FlaxSpeaker
Speaker recognition in Flax
DecisionForest
Decision Tree and Decision Forest for Matlab
my_linux_config
My configs for Linux
colortimelog
Logging elapsed time and errors in colors
d2
D2 is a modern diagram scripting language that turns text to diagrams.
Fast-Gradient-Vector-Flow
This package implements the Gradient Vector Flow (GVF) in C++/MEX.
lingvo
Lingvo
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
SimpleMatrix
SimpleMatrix is an extremely lightweight matrix library, containing a single header file.
speaker-id
This repository contains audio samples and supplementary materials accompanying publications related to the speaker-id team at Google.
word_levenshtein
Levenshtein algorithm in C++ for Python