dongqi shen's repositories
gemini2openai
This project converts the Gemini Embedding API into a format compatible with OpenAI’s API and deploys it on Cloudflare, enabling free and seamless integration and usage with the OpenAI Python library.
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
awesome-cheatsheets
👩💻👨💻 Awesome cheatsheets for popular programming languages, frameworks and development tools. They include everything you should know in one single file.
dongqishen.github.io
Dongqi's Leisure Time
dongqishen.github.io_2
Dongqi's leisure time.
mlx-examples
Examples in the MLX framework
ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
edgetunnel
在原版的基础上修改了显示 VLESS 配置信息转换为订阅内容。使用该脚本,你可以方便地将 VLESS 配置信息使用在线配置转换到 Clash 或 Singbox 等工具中。
fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
FlashAttention-PyTorch
Implementation of FlashAttention in PyTorch
ggml
Tensor library for machine learning
gptpdf
Using GPT to parse PDF
gradio
Create UIs for your machine learning model in Python in 3 minutes
igemm
igemm tutorial
iLLM
Implementing LLM from scratch. (Developing...)
ips
优选ip
llama.cpp
Port of Facebook's LLaMA model in C/C++
llm.c
LLM training in simple, raw C/CUDA
llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
llvm-tutor
A collection of out-of-tree LLVM passes for teaching and learning
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
tvm-cn
TVM Documentation in Chinese Simplified / TVM 中文文档
WorkerVless2sub
这个是一个将 Cloudflare Workers - VLESS 搭配 自建优选域名 的 订阅生成器