Bingchen Zhao's starred repositories
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 15.95% tasks in full SWE-bench
torchtitan
A native PyTorch Library for large model training
agent-protocol
Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building agents.
searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
visualwebarena
VisualWebArena is a benchmark for multimodal agents.
frequency_determines_performance
Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance"
llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“