Sheng Zhao's starred repositories
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
CaptainBlackboard
船长关于机器学习、计算机视觉和工程技术的总结和分享
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
RESTFUL_ASR
基于wenet的短时在线语音识别服务
voxceleb_trainer
In defence of metric learning for speaker recognition
speaker-verification
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
MedSegDiff
Medical Image Segmentation with Diffusion Model
speechbrain
A PyTorch-based Speech Toolkit
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
wordpress-github-sync
A WordPress plugin to sync content with a GitHub repository (or Jekyll site)
git-it-write
A WordPress plugin to publish markdown files present in a Github repository as posts to WordPress automatically.
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!