Lichang Chen's repositories
InstructZero
Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!
claude2-alpaca
First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!
reward-trl
Train transformer language models with reinforcement learning.
Chain-of-ThoughtsPapers
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,近30万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
Lichang-Chen.github.io
The github personal webpage for Lichang Chen.
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
zero_shot_cot
Prod Env
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.