koalazf99

Fan's starred repositories

aider

aider is AI pair programming in your terminal

Language:PythonApache-2.01503800

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0454400

MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Language:Jupyter Notebook30100

LoRA-GA

Language:Python7900

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Language:PythonApache-2.0352500

CompilerGym

Reinforcement learning environments for compiler and program optimization tasks

Language:PythonMIT88800

Minitron

A family of compressed models obtained via pruning and knowledge distillation

7300

scaling-with-vocab

📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Language:Python3500

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:Python64900

weak-to-strong-reasoning

Language:Python3000

SciCode

A benchmark that challenges language models to code solutions for scientific problems

Language:PythonApache-2.05800

ENVISIONS

A Neural-Symbolic Self-Training Framework

Language:C9100

lean4game

Server to host lean games.

Language:TypeScriptGPL-3.015300

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.01336300

OpenWebMath

Language:XSLTApache-2.09900

anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Language:Python55700

regmix

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

Language:Jupyter NotebookMIT5300

gptpdf

Using GPT to parse PDF

Language:PythonMIT251300

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookMIT141100

koalazf99

Fan's starred repositories

aider

seeclick-crawler

segment-anything-2

MAmmoTH

LoRA-GA

PDF-Extract-Kit

CompilerGym

Minitron

scaling-with-vocab

persona-hub

weak-to-strong-reasoning

SciCode

ENVISIONS

lean4game

unsloth

OpenWebMath

anole

regmix

gptpdf

Phi-3CookBook

cambrian

flash-attention

aqt

magentic

remiss-jailbreak

LLM101n

OpenRLHF

DL4TP

trafilatura

Awesome-DataCentric-LLM