ericxsun

QinLuo's starred repositories

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python8971 111 187

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6269 60 74

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.05189 63 476

Memex

Browser extension to curate, annotate, and discuss the most valuable content and ideas on the web. As individuals, teams and communities.

Language:TypeScript4349 68 593

llm-foundry

LLM training code for Databricks foundation models

Language:PythonApache-2.03754 45 356

h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/

Language:PythonApache-2.03656 44 340

localpilot

Language:PythonMIT3318 24 25

leptonai

A Pythonic framework to simplify AI service building

Language:PythonApache-2.02478 21 52

vdp

💧 Instill VDP (Versatile Data Pipeline) is an open-source tool to seamlessly integrate AI to process unstructured data in the modern data stack

Language:MakefileNOASSERTION1697 250

state-of-open-source-ai

:closed_book: Clarity in the current fast-paced mess of Open Source innovation

Language:TeXNOASSERTION1445 22 41

prompts-for-edu

MIT1442 46 6

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonMIT1357 38 33

Skywork

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数，训练数据，评估数据，评估方法。

Language:PythonNOASSERTION1116 21 61

modelzoo

Language:PythonApache-2.0857 22 14

local-persist

Create named local volumes that persist in the location(s) you want

Language:GoMIT841 28 72

rl_a3c_pytorch

A3C LSTM Atari with Pytorch plus A3G design

Language:PythonApache-2.0557 20 42

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonMIT449 8 5

carton

Run any ML model from any programming language.

Language:RustApache-2.0417 4 44

adept-inference

Inference code for Persimmon-8B

Language:PythonApache-2.0411 16 7

libgen_to_txt

Convert all of libgen to high quality markdown

Language:PythonMIT223 5 3

pynvml

Provide Python access to the NVML library for GPU diagnostics

Language:PythonBSD-3-Clause200 13 34

AttentionIsOFFByOne

Implementation of "Attention Is Off By One" by Evan Miller

Language:PythonMIT173 6 6

multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

Language:PythonMIT157 3 3

smart_agent

Language:Python106 2 1

AutoNetGen

让 AI 设计 AI，让大模型帮助小模型进化，用魔法创造魔法！ Empower Artificial Intelligence to sculpt its own kind, where colossal models gracefully usher the petite ones into evolution, weaving magic to conjure further enchantment!

Language:Python88 4 1

Nanbeige

Language:PythonApache-2.080 2 3

ADaPT

Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"

Language:PythonMIT54 3 6

Omnigrok

Omnigrok: Grokking Beyond Algorithmic Data

Language:Jupyter Notebook33 20

colorbindiff

A visual and colorized diff for binary files.

Language:PerlLGPL-3.029 20

tqdm-loggable

Logging friendly progress messages for TQDM progress bars

Language:Python18 1 3