Fan (koalazf99)

koalazf99

Geek Repo

Company:Shanghai Jiao Tong University

Location:Shanghai

Home Page:koalazf99.github.io

Twitter:@FaZhou_998

Github PK Tool:Github PK Tool


Organizations
OpenLemur

Fan's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:12347Issues:87Issues:569

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11892Issues:103Issues:864

trafilatura

Python & command-line tool to gather text on the Web: Crawling & scraping, content extraction, metadata. TXT, Markdown, CSV & XML output.

Language:PythonLicense:Apache-2.0Stargazers:3189Issues:30Issues:336
Language:PythonLicense:Apache-2.0Stargazers:2444Issues:31Issues:23

gptpdf

Using GPT to parse PDF

magentic

Seamlessly integrate LLMs as Python functions

Language:PythonLicense:MITStargazers:1810Issues:13Issues:58

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1720Issues:22Issues:176

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1445Issues:17Issues:14

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookLicense:MITStargazers:1223Issues:12Issues:22
Language:PythonLicense:Apache-2.0Stargazers:216Issues:6Issues:25

magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing"

Language:PythonLicense:MITStargazers:168Issues:5Issues:11

anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Language:PythonStargazers:155Issues:0Issues:0

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:153Issues:26Issues:11

vscode-lean4

Visual Studio Code extension for the Lean 4 proof assistant

Language:TypeScriptLicense:Apache-2.0Stargazers:137Issues:13Issues:193

bigcodebench

BigCodeBench: The Next Generation of HumanEval

Language:PythonLicense:Apache-2.0Stargazers:116Issues:5Issues:9
Language:XSLTLicense:Apache-2.0Stargazers:95Issues:3Issues:2

DL4TP

A Survey on Deep Learning for Theorem Proving

License:MITStargazers:89Issues:5Issues:0

OlympicArena

This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"

Language:JavaScriptStargazers:69Issues:0Issues:0
Language:PythonLicense:MITStargazers:39Issues:4Issues:0
Language:PythonLicense:Apache-2.0Stargazers:29Issues:0Issues:0

regmix

[arXiv 2024] RegMix: Data Mixture as Regression for Language Model Pre-training

Language:Jupyter NotebookLicense:MITStargazers:28Issues:0Issues:0

agent-attack

[Arxiv 2024] Adversarial Attacks on Multimodal Agents

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

MoPS

[ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"

Language:Jupyter NotebookStargazers:19Issues:0Issues:0

Spider2-V

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18Issues:2Issues:0
Language:PythonStargazers:15Issues:2Issues:0

tpu_pod_commander

TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.

Language:PythonLicense:Apache-2.0Stargazers:8Issues:0Issues:0

Awesome-DataCentric-LLM

trending projects & awesome papers about data-centric llm studies.

Stargazers:7Issues:0Issues:0