wy192's repositories
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
cleanlab
The standard package for machine learning with noisy labels and finding mislabeled data. Works with most datasets and models.
CTranslate2
Fast inference engine for Transformer models
D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
DecryptLogin
DecryptLogin: APIs for loginning some websites by using requests.
deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
espnet
End-to-End Speech Processing Toolkit
ETDNN
Code for paper in "ECAPA-TDNN Based Depression Detection from Clinical Speech"
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
mica-speech-activity-detection
Robust Speech Activity Detection (SAD) in movie audio
PaSST
Efficient Training of Audio Transformers with Patchout
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
pykaldi
A Python wrapper for Kaldi
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Pytorch-TestRankIQA
RankIQA model files in Pytorch. Test RankIQA on TID2013 or LIVE dataset in Pytorch.
PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Resemblyzer
A python package to analyze and compare voices with deep learning
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity and Number Detector
SpeechAlgorithms
Speech Algorithms Collections
speechbrain
A PyTorch-based Speech Toolkit
spiders
Python爬虫,返回一定格式的信息,下载,使用flask提供简易api。抖音无水印、皮皮虾、快手、网易云音乐、qq音乐、咪咕音乐、荔枝FM音频、知乎视频、最右语音、视频、微博......
TCN
Sequence modeling benchmarks and temporal convolutional networks
voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.