Hu Hengyu's starred repositories
pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
MossFormer2
This is the audio sample repository for speech separation model "MossFormer2".
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
NSD-MA-MSE
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
FastGPT
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
chaoxing-sign-cli
超星学习通签到:支持普通签到、拍照签到、手势签到、位置签到、二维码签到,支持自动监测、QQ机器人签到与推送。
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
project-based-learning
Curated list of project-based tutorials