suolyer

suolyer

Geek Repo

Github PK Tool:Github PK Tool

suolyer's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49109Issues:0Issues:0

FlashAttention20Triton

Triton implementation of Flash Attention2.0

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

License:MITStargazers:512Issues:0Issues:0

long-llms-learning

A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks

Language:Jupyter NotebookStargazers:216Issues:0Issues:0

triton_flashv2_alibi

working repo for Triton based Flash2 supporting alibi pos embeddings

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

synthesizer

A multi-purpose LLM framework for RAG and data creation.

Language:PythonLicense:Apache-2.0Stargazers:599Issues:0Issues:0

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:458Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1656Issues:0Issues:0
Language:RustLicense:Apache-2.0Stargazers:1038Issues:0Issues:0

text-dedup

Python package for memory-friendly text de-duplication

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:293Issues:0Issues:0

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

License:MITStargazers:2168Issues:0Issues:0

cc_net

Tools to download and cleanup Common Crawl data

Language:PythonLicense:MITStargazers:927Issues:0Issues:0

Arxiv-NLP-Reporter

每日自动获取Arxiv上NLP相关最新论文【Arxiv Natural Language Processing Paper Automatic Crawl Daily】

Language:PythonStargazers:15Issues:0Issues:0

TianMu

TianMu: A modern AI tool with multi-platform support, markdown support, multimodal, continuous conversation, and customizable commands. 一个APP支持文心一言、通义千问、LLaMa、ChatGPT等,开源的大模型客户端!

Stargazers:87Issues:0Issues:0

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:1973Issues:0Issues:0

sft_datasets

开源SFT数据集整理,随时补充

Stargazers:380Issues:0Issues:0

Cornucopia-LLaMA-Fin-Chinese

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

Language:PythonLicense:Apache-2.0Stargazers:565Issues:0Issues:0

WanJuan1.0

万卷1.0多模态语料

License:CC-BY-4.0Stargazers:429Issues:0Issues:0

Finetune_LLAMA

简单易懂的LLaMA微调指南。

Language:PythonStargazers:306Issues:0Issues:0
Stargazers:740Issues:0Issues:0

High-quality-Chinese-Q-A-dataset

最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM

Language:PythonStargazers:7Issues:0Issues:0

SYSU-Exam

收集整理SYSU期末考试卷子、资料

License:MITStargazers:1703Issues:0Issues:0
License:Apache-2.0Stargazers:4700Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:12694Issues:0Issues:0

M3KE

A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark

Stargazers:90Issues:0Issues:0

Open-Llama

The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.

License:MITStargazers:54Issues:0Issues:0

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Language:PythonLicense:MITStargazers:2146Issues:0Issues:0

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:12068Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:C++License:BSD-3-ClauseStargazers:6Issues:0Issues:0