Fangkai Jiao's starred repositories
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Inflection-Benchmarks
Public Inflection Benchmarks
llm-planning-eval
Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
LLMSanitize
An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).
dpo-trajectory-reasoning
Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
SimulateBench
GPT as Human