ShuoTang123's starred repositories

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:38072Issues:397Issues:67

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29424Issues:339Issues:268

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:13428Issues:93Issues:16

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7822Issues:97Issues:1595

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:5610Issues:56Issues:555

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

yq

Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents

Language:PythonLicense:Apache-2.0Stargazers:2601Issues:31Issues:157

MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Language:PythonLicense:Apache-2.0Stargazers:2563Issues:35Issues:18

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:2241Issues:21Issues:255

MOSS-RLHF

MOSS-RLHF

Language:PythonLicense:Apache-2.0Stargazers:1279Issues:34Issues:53

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:1133Issues:38Issues:54

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonLicense:Apache-2.0Stargazers:719Issues:7Issues:22

SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonLicense:MITStargazers:675Issues:8Issues:67

chat_templates

Chat Templates for 🤗 HuggingFace Large Language Models

Language:JinjaLicense:MITStargazers:489Issues:7Issues:14

deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Language:PythonLicense:Apache-2.0Stargazers:479Issues:6Issues:27

pal

PaL: Program-Aided Language Models (ICML 2023)

Language:PythonLicense:Apache-2.0Stargazers:470Issues:9Issues:14

magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Language:PythonLicense:MITStargazers:441Issues:5Issues:26

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

AgentGym

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Language:PythonLicense:MITStargazers:330Issues:4Issues:13

ntscraper

Scrape from Twitter using Nitter instances

Language:PythonLicense:MITStargazers:172Issues:7Issues:70

self-speculative-decoding

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:132Issues:2Issues:21

ilya-sutskever-recommended-reading

It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.

Language:PythonLicense:NOASSERTIONStargazers:60Issues:1Issues:5

TwitterCrawler

抓取twitter数据,可根据时间、话题、用户名等条件抓取数据,twitter爬虫

Language:JavaStargazers:39Issues:0Issues:1
Language:PythonLicense:MITStargazers:13Issues:0Issues:0

social_simulation

website repo for agent-based social movement simulation

Language:JavaScriptLicense:MITStargazers:13Issues:1Issues:0