Lichang Chen's starred repositories
llama_index
LlamaIndex is a data framework for your LLM applications
flash-attention
Fast and memory-efficient exact attention
alignment-handbook
Robust recipes to align language models with human and AI preferences
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
TruthfulQA
TruthfulQA: Measuring How Models Imitate Human Falsehoods
instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
NeMo-Megatron-Launcher
NeMo Megatron launcher and tools
DHS-LLM-Workshop
DHS 2023 LLM Workshop by Sourab Mangrulkar
HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
InstructZero
Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!
alpaca-qlora
Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA
Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
AlpaGasus2-QLoRA
This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!
claude2-alpaca
First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!
sudo-Boris.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes