lxww302

lxww302

Geek Repo

Github PK Tool:Github PK Tool

lxww302's starred repositories

Language:MATLABLicense:GPL-3.0Stargazers:9967Issues:0Issues:0

WildBench

Benchmarking LLMs with Challenging Tasks from Real Users

Language:PythonLicense:Apache-2.0Stargazers:139Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:62282Issues:0Issues:0

llm-foundry

LLM training code for Databricks foundation models

Language:PythonLicense:Apache-2.0Stargazers:3867Issues:0Issues:0

FlagScale

FlagScale is a large model toolkit based on open-sourced projects.

Language:PythonLicense:NOASSERTIONStargazers:103Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27013Issues:0Issues:0

books-7

A book a day, keep stupid away

Language:PythonStargazers:116Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1790Issues:0Issues:0

fastmoe

A fast MoE impl for PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1479Issues:0Issues:0

TheoremQA

The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5887Issues:0Issues:0

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:260Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9246Issues:0Issues:0
Language:PythonStargazers:760Issues:0Issues:0

ICU-tokenizer

ICU based universal language tokenizer

Language:PythonLicense:MITStargazers:28Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3419Issues:0Issues:0
Language:PythonLicense:MITStargazers:1396Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:978Issues:0Issues:0

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:1110Issues:0Issues:0

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonLicense:MITStargazers:532Issues:0Issues:0

MS-MARCO-Web-Search

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

License:MITStargazers:293Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1324Issues:0Issues:0

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonLicense:Apache-2.0Stargazers:965Issues:0Issues:0

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookLicense:MITStargazers:1346Issues:0Issues:0

fp6_llm

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Language:CudaLicense:Apache-2.0Stargazers:160Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1778Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:11306Issues:0Issues:0

granite-code-models

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

License:Apache-2.0Stargazers:1019Issues:0Issues:0

Bend

A massively parallel, high-level programming language

Language:RustLicense:Apache-2.0Stargazers:16836Issues:0Issues:0

c-style

My favorite C programming practices.

License:NOASSERTIONStargazers:1919Issues:0Issues:0