zakahan's starred repositories
SenseVoice
Multilingual Voice Understanding Model
BlindWatermark
使用盲水印保护创作者的知识产权using invisible watermark to protect creator's intellectual property
midieditor
Provides an interface to edit, record, and play Midi data
TableQAKit
A Toolkit for Table-based Question Answering
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Awesome-Tabular-LLMs
We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理
python-pinyin
汉字转拼音(pypinyin)
CharsiuG2P
Multilingual G2P in 100 languages
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
stable-diffusion-webui
Stable Diffusion web UI