xiaohou's repositories
KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
Audiomer-PyTorch
A Convolutional Transformer for Keyword Spotting
SenseVoice
Multilingual Voice Understanding Model
ChatLaw
中文法律大模型
chinese_speech_pretrain
chinese speech pretrained models
ChineseLyrics
10W首中文歌词数据库
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
phonemizer
Simple text to phones converter for multiple languages
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
speechbrain
A PyTorch-based Speech Toolkit
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
whisper.cpp
Port of OpenAI's Whisper model in C/C++