ruihuihou's starred repositories
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
sft_datasets
开源SFT数据集整理,随时补充
vocab-coverage
语言模型中文认知能力分析
CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
promptsource
Toolkit for creating, sharing and using natural language prompts.
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
ColossalAI
Making large AI models cheaper, faster and more accessible
bert4keras
keras implement of transformers for humans
bert4torch
An elegent pytorch implement of transformers