Beast code in Giters

lxww302's starred repositories

WildBench

Benchmarking LLMs with Challenging Tasks from Real Users

Language:PythonApache-2.013900

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonGPL-3.06228200

llm-foundry

LLM training code for Databricks foundation models

Language:PythonApache-2.0386700

FlagScale

FlagScale is a large model toolkit based on open-sourced projects.

Language:PythonNOASSERTION10300

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.02701300

books-7

A book a day, keep stupid away

Language:Python11600

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonApache-2.0179000

fastmoe

A fast MoE impl for PyTorch

Language:PythonApache-2.0147900

TheoremQA

The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)

Language:PythonMIT1500

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT588700

dclm

DataComp for Language Models

Language:HTMLMIT26000

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonGPL-3.0924600

MAP-NEO

Language:Python76000

ICU-tokenizer

ICU based universal language tokenizer

Language:PythonMIT2800

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonApache-2.0341900

simple-evals

Language:PythonMIT139600

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.097800

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonMIT111000

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonMIT53200

MS-MARCO-Web-Search

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

MIT29300

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION132400

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonApache-2.096500

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookMIT134600

lxww302