Ma-Dan's starred repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
everyone-can-use-english
人人都能用英语
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。
GeminiProChat
Minimal web UI for GeminiPro.
ml-interviews-book
https://huyenchip.com/ml-interviews-book/
gowebsocket
golang基于websocket单台机器支持百万连接分布式聊天(IM)系统
KuiperInfer
带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
KG-demo-for-movie
从无到有构建一个电影知识图谱,并基于该KG,开发一个简易的KBQA程序。
vocal-separate
an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网
KnowledgeGraph
史上最大规模1.4亿知识图谱数据免费下载,知识图谱,通用知识图谱,融合了两千五百多万的实体,拥有亿级别的实体属性关系。
ChatLM-mini-Chinese
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
awesome-drones-zh
无人机资源汇总
whisper-trtllm
Whisper in TensorRT-LLM
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.