EazyReal

Yan-Tong Lin's starred repositories

llama

Inference code for LLaMA models

Language:PythonNOASSERTION50895 499 872

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.033920 209 5189

autogen

A programming framework for agentic AI 🤖

Language:Jupyter NotebookCC-BY-4.032877 389 1875

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookMIT20567 861 155

BackgroundMusic

Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.

Language:C++GPL-2.016225 151 662

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause14109 120 1105

gvm

Go Version Manager

Language:ShellMIT10312 150 327

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonMIT7702 144 47

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonMIT7294 44 466

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6651 65 82

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Language:PythonApache-2.06291 112 206

exiftool

ExifTool meta information reader/writer

Language:PerlGPL-3.03265 59 241

chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

Language:C++MIT2935 43 252

PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Language:PythonNOASSERTION2616 18 374

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonApache-2.02202 27 141

TextWorld

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Language:Jupyter NotebookNOASSERTION1221 39 83

llm-reasoners

A library for advanced large language model reasoning

Language:PythonApache-2.0774 14 19

rebel

An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

Language:C++Apache-2.0653 26 33

chemcrow-public

Chemcrow

Language:PythonMIT616 18 21

LanguageAgentTreeSearch

Official repository for ICML'24 paper "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Language:PythonMIT506 9 18

Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

328 60

bitfinex-api-go

BITFINEX Go trading API - Bitcoin, Litecoin, and Ether exchange

Language:GoMIT310 35 83

eth

Dark Forest contracts

Language:TypeScriptGPL-3.0297 13 2

miniwob-plusplus

MiniWoB++: a web interaction benchmark for reinforcement learning

Language:HTMLMIT284 15 24

stylus-sdk-rs

Rust Smart Contracts on Arbitrum

Language:Rust240 13 36

Caliptra

Caliptra IP and firmware for integrated Root of Trust block

Apache-2.0234 39 94

MicroRTS-Py

A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)

Language:PythonMIT232 11 39

LLM-with-RL-papers

A collection of LLM with RL papers

228 8 3

Reinforcement-Learning-for-Market-Making

Using tabular and deep reinforcement learning methods to infer optimal market making strategies

Language:Jupyter Notebook163 40

RAP

Reasoning with Language Model is Planning with World Model

Language:PDDLMIT144 3 8