Fangkai Jiao's starred repositories
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Q-Instruct
②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.
SimulateBench
GPT as Human
dpo-trajectory-reasoning
Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".