Jiaxian Guo's starred repositories
alignment-handbook
Robust recipes to align language models with human and AI preferences
RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
LLaMA2-Accessory
An Open-source Toolkit for LLM Development
LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
lm-human-preference-details
RLHF implementation details of OAI's 2019 codebase
Machine-Learning-Interviews
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
LLM-FineTuning-Large-Language-Models
LLM (Large Language Model) FineTuning
ml-engineering
Machine Learning Engineering Open Book
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
KwaiAgents
A generalized information-seeking agent system with Large Language Models (LLMs).
corr2cause
Data and code for the Corr2Cause paper (ICLR 2024)