Fan's starred repositories
flash-attention
Fast and memory-efficient exact attention
trafilatura
Python & command-line tool to gather text on the Web: Crawling & scraping, content extraction, metadata. TXT, Markdown, CSV & XML output.
DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
vscode-lean4
Visual Studio Code extension for the Lean 4 proof assistant
bigcodebench
BigCodeBench: The Next Generation of HumanEval
OlympicArena
This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"
agent-attack
[Arxiv 2024] Adversarial Attacks on Multimodal Agents
tpu_pod_commander
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
Awesome-DataCentric-LLM
trending projects & awesome papers about data-centric llm studies.