Ma-Dan's repositories
llama2.c-to-ncnn
A converter for llama2.c legacy models to ncnn models.
ChatLM-mini-Chinese
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调。
CUDA-Learn-Note
🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,个人笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
ChatGLM-MNN
Pure C++, Easy Deploy ChatGLM-6B.
ColossalAI
Making large AI models cheaper, faster and more accessible
DAIL-SQL
A efficient and effective few-shot NL2SQL method on GPT-4.
DB-GPT
Revolutionizing Database Interactions with Private LLM Technology
FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
Fooocus
Focus on prompting and generating
InferLLM
a lightweight LLM model inference framework
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
llama2.c
Inference Llama 2 in one file of pure C
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Open-Sora-Plan
This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
PythonRobotics
Python sample codes for robotics algorithms.
QAnything
Question and Answer based on Anything.
RTK
Reconstruction Toolkit
sherpa
Speech-to-text server framework with next-gen Kaldi
sherpa-ncnn
Real-time speech recognition using next-gen Kaldi with ncnn
TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and per
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
WeTextProcessing
Text Normalization & Inverse Text Normalization
whisper
Robust Speech Recognition via Large-Scale Weak Supervision