suolyer

followers

0

following

stars

suolyer's starred repositories

grok-1

Grok open release

Language:PythonApache-2.04910900

FlashAttention20Triton

Triton implementation of Flash Attention2.0

Language:PythonMIT1500

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

MIT51200

long-llms-learning

A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks

Language:Jupyter Notebook21600

triton_flashv2_alibi

working repo for Triton based Flash2 supporting alibi pos embeddings

Language:PythonMIT100

synthesizer

A multi-purpose LLM framework for RAG and data creation.

Language:PythonApache-2.059900

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonMIT45800

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据！

Language:PythonApache-2.0165600

deduplicate-text-datasets

Language:RustApache-2.0103800

text-dedup

Python package for memory-friendly text de-duplication

Language:PythonApache-2.0600

llamafia.github

Language:PythonApache-2.029300

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

MIT216800

cc_net

Tools to download and cleanup Common Crawl data

Language:PythonMIT92700

Arxiv-NLP-Reporter

每日自动获取Arxiv上NLP相关最新论文【Arxiv Natural Language Processing Paper Automatic Crawl Daily】

Language:Python1500

TianMu

TianMu: A modern AI tool with multi-platform support, markdown support, multimodal, continuous conversation, and customizable commands. 一个APP支持文心一言、通义千问、LLaMa、ChatGPT等，开源的大模型客户端！

8700

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.0197300

sft_datasets

开源SFT数据集整理,随时补充

Cornucopia-LLaMA-Fin-Chinese

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

Language:PythonApache-2.056500

WanJuan1.0

万卷1.0多模态语料

CC-BY-4.042900

Finetune_LLAMA

简单易懂的LLaMA微调指南。

Language:Python30600

literature-books

书籍txt

High-quality-Chinese-Q-A-dataset

最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM

Language:Python700

SYSU-Exam

收集整理SYSU期末考试卷子、资料

MIT170300

awesome-LLMs-In-China

**大模型

Apache-2.0470000

Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

M3KE

A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark

9000

Open-Llama

The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.

MIT5400

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Language:PythonMIT214600

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonMIT1206800

flash-attention

Fast and memory-efficient exact attention

Language:C++BSD-3-Clause600