ShuoTang123's starred repositories

yq

Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents

Language:PythonLicense:Apache-2.0Stargazers:2581Issues:0Issues:0

MOSS-RLHF

MOSS-RLHF

Language:PythonLicense:Apache-2.0Stargazers:1265Issues:0Issues:0

pal

PaL: Program-Aided Language Models (ICML 2023)

Language:PythonLicense:Apache-2.0Stargazers:462Issues:0Issues:0
Language:PythonLicense:MITStargazers:13Issues:0Issues:0

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:5094Issues:0Issues:0

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:1103Issues:0Issues:0

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:PythonStargazers:767Issues:0Issues:0

ntscraper

Scrape from Twitter using Nitter instances

Language:PythonLicense:MITStargazers:168Issues:0Issues:0

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonLicense:Apache-2.0Stargazers:688Issues:0Issues:0

ilya-sutskever-recommended-reading

It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.

Stargazers:78Issues:0Issues:0

MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Language:PythonLicense:Apache-2.0Stargazers:2526Issues:0Issues:0

magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Language:PythonLicense:MITStargazers:389Issues:0Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

License:MITStargazers:3405Issues:0Issues:0

chat_templates

Chat Templates for 🤗 HuggingFace Large Language Models

Language:JinjaLicense:MITStargazers:454Issues:0Issues:0
Language:PythonStargazers:114Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:37057Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:56Issues:0Issues:0

AgentGym

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Language:PythonLicense:MITStargazers:301Issues:0Issues:0

social_simulation

website repo for agent-based social movement simulation

Language:JavaScriptLicense:MITStargazers:12Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29337Issues:0Issues:0

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:PythonStargazers:9193Issues:0Issues:0

deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Language:PythonLicense:Apache-2.0Stargazers:465Issues:0Issues:0

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonLicense:MITStargazers:637Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:2013Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:13050Issues:0Issues:0

TwitterCrawler

抓取twitter数据,可根据时间、话题、用户名等条件抓取数据,twitter爬虫

Language:JavaStargazers:38Issues:0Issues:0
Language:PythonStargazers:25Issues:0Issues:0

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7656Issues:0Issues:0

self-speculative-decoding

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:127Issues:0Issues:0

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

License:Apache-2.0Stargazers:347Issues:0Issues:0