Fan (koalazf99)

koalazf99

Geek Repo

Company:Shanghai Jiao Tong University

Location:Shanghai

Home Page:koalazf99.github.io

Twitter:@FaZhou_998

Github PK Tool:Github PK Tool


Organizations
OpenLemur

Fan's starred repositories

aider

aider is AI pair programming in your terminal

Language:PythonLicense:Apache-2.0Stargazers:15038Issues:0Issues:0
Language:PythonStargazers:9Issues:0Issues:0

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4544Issues:0Issues:0

MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Language:Jupyter NotebookStargazers:301Issues:0Issues:0
Language:PythonStargazers:79Issues:0Issues:0

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Language:PythonLicense:Apache-2.0Stargazers:3525Issues:0Issues:0

CompilerGym

Reinforcement learning environments for compiler and program optimization tasks

Language:PythonLicense:MITStargazers:888Issues:0Issues:0

Minitron

A family of compressed models obtained via pruning and knowledge distillation

Stargazers:73Issues:0Issues:0

scaling-with-vocab

📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Language:PythonStargazers:35Issues:0Issues:0

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:PythonStargazers:649Issues:0Issues:0
Language:PythonStargazers:30Issues:0Issues:0

SciCode

A benchmark that challenges language models to code solutions for scientific problems

Language:PythonLicense:Apache-2.0Stargazers:58Issues:0Issues:0

ENVISIONS

A Neural-Symbolic Self-Training Framework

Language:CStargazers:91Issues:0Issues:0

lean4game

Server to host lean games.

Language:TypeScriptLicense:GPL-3.0Stargazers:153Issues:0Issues:0

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:13363Issues:0Issues:0
Language:XSLTLicense:Apache-2.0Stargazers:99Issues:0Issues:0

anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Language:PythonStargazers:557Issues:0Issues:0

regmix

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

Language:Jupyter NotebookLicense:MITStargazers:53Issues:0Issues:0

gptpdf

Using GPT to parse PDF

Language:PythonLicense:MITStargazers:2513Issues:0Issues:0

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookLicense:MITStargazers:1411Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1612Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12669Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:227Issues:0Issues:0

magentic

Seamlessly integrate LLMs as Python functions

Language:PythonLicense:MITStargazers:1847Issues:0Issues:0
Language:PythonStargazers:13Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:26156Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1848Issues:0Issues:0

DL4TP

[COLM 2024] A Survey on Deep Learning for Theorem Proving

License:MITStargazers:98Issues:0Issues:0

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Language:PythonLicense:Apache-2.0Stargazers:3256Issues:0Issues:0

Awesome-DataCentric-LLM

trending projects & awesome papers about data-centric llm studies.

Stargazers:8Issues:0Issues:0