Cao Lijun's starred repositories
PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
byte-unixbench
Automatically exported from code.google.com/p/byte-unixbench
libnvidia-container
NVIDIA container runtime library
so-large-lm
大模型基础: 一文了解大模型基础知识
dive-into-llms
《动手学大模型Dive into LLMs》系列编程实践教程
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
llms-from-scratch-cn
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
it-tools-zh_CN
为开发人员提供的方便的在线工具集合,具有出色的用户体验。【深度适配简体中文】
Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
chinese-llm-benchmark
中文大模型能力评测榜单:目前已囊括106个大模型,覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型, 以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
llm-inference-benchmark
LLM Inference benchmark