Beast code in Giters

ShuoTang123's starred repositories

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.038072 397 67

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.029424 339 268

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT13428 93 16

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python9224 114 190

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonApache-2.07822 97 1595

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonApache-2.05610 56 555

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

MIT3489 29 84

yq

Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents

Language:PythonApache-2.02601 31 157

MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Language:PythonApache-2.02563 35 18

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonApache-2.02241 21 255

MOSS-RLHF

Language:PythonApache-2.01279 34 53

dclm

DataComp for Language Models

Language:HTMLMIT1133 38 54

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:Python825 15 7

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonApache-2.0719 7 22

SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonMIT675 8 67

chat_templates

Chat Templates for 🤗 HuggingFace Large Language Models

Language:JinjaMIT489 7 14

deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Language:PythonApache-2.0479 6 27

pal

PaL: Program-Aided Language Models (ICML 2023)

Language:PythonApache-2.0470 9 14

magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Language:PythonMIT441 5 26