陈帅霖's starred repositories
json_repair
A python module to repair invalid JSON, commonly used to parse the output of LLMs
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Microsoft-Activation-Scripts
A Windows and Office activator using HWID / Ohook / KMS38 / Online KMS activation methods, with a focus on open-source code and fewer antivirus detections.
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
GPT-Fathom
GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under aligned settings.
alignment-handbook
Robust recipes to align language models with human and AI preferences
hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
awesome-hallucination-detection
List of papers on hallucination detection in LLMs.
Awesome-LLM-hallucination
LLM hallucination paper list
Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
openai-cookbook
Examples and guides for using the OpenAI API
SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️