Yan-Tong Lin's starred repositories
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
BackgroundMusic
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
flash-attention
Fast and memory-efficient exact attention
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
mesh-transformer-jax
Model parallel transformers in JAX and Haiku
chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
llm-reasoners
A library for advanced large language model reasoning
chemcrow-public
Chemcrow
LanguageAgentTreeSearch
Official repository for ICML'24 paper "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
bitfinex-api-go
BITFINEX Go trading API - Bitcoin, Litecoin, and Ether exchange
miniwob-plusplus
MiniWoB++: a web interaction benchmark for reinforcement learning
stylus-sdk-rs
Rust Smart Contracts on Arbitrum
MicroRTS-Py
A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)
LLM-with-RL-papers
A collection of LLM with RL papers
Reinforcement-Learning-for-Market-Making
Using tabular and deep reinforcement learning methods to infer optimal market making strategies