Yan-Tong Lin (EazyReal)

EazyReal

Geek Repo

Company:Georgia Tech

Location:Atlanta, GA

Home Page:eazyreal.github.io

Twitter:@tensorfi

Github PK Tool:Github PK Tool

Yan-Tong Lin's starred repositories

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:33920Issues:209Issues:5189

autogen

A programming framework for agentic AI 🤖

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:32877Issues:389Issues:1875

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookLicense:MITStargazers:20567Issues:861Issues:155

BackgroundMusic

Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.

Language:C++License:GPL-2.0Stargazers:16225Issues:151Issues:662

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:14109Issues:120Issues:1105

gvm

Go Version Manager

Language:ShellLicense:MITStargazers:10312Issues:150Issues:327

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7702Issues:144Issues:47

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:7294Issues:44Issues:466

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6651Issues:65Issues:82

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Language:PythonLicense:Apache-2.0Stargazers:6291Issues:112Issues:206

exiftool

ExifTool meta information reader/writer

Language:PerlLicense:GPL-3.0Stargazers:3265Issues:59Issues:241

chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

Language:C++License:MITStargazers:2935Issues:43Issues:252

PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Language:PythonLicense:NOASSERTIONStargazers:2616Issues:18Issues:374

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:2202Issues:27Issues:141

TextWorld

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1221Issues:39Issues:83

llm-reasoners

A library for advanced large language model reasoning

Language:PythonLicense:Apache-2.0Stargazers:774Issues:14Issues:19

rebel

An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

Language:C++License:Apache-2.0Stargazers:653Issues:26Issues:33
Language:PythonLicense:MITStargazers:616Issues:18Issues:21

LanguageAgentTreeSearch

Official repository for ICML'24 paper "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Language:PythonLicense:MITStargazers:506Issues:9Issues:18

Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

bitfinex-api-go

BITFINEX Go trading API - Bitcoin, Litecoin, and Ether exchange

Language:GoLicense:MITStargazers:310Issues:35Issues:83

eth

Dark Forest contracts

Language:TypeScriptLicense:GPL-3.0Stargazers:297Issues:13Issues:2

miniwob-plusplus

MiniWoB++: a web interaction benchmark for reinforcement learning

Language:HTMLLicense:MITStargazers:284Issues:15Issues:24

stylus-sdk-rs

Rust Smart Contracts on Arbitrum

Caliptra

Caliptra IP and firmware for integrated Root of Trust block

MicroRTS-Py

A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)

Language:PythonLicense:MITStargazers:232Issues:11Issues:39

LLM-with-RL-papers

A collection of LLM with RL papers

Reinforcement-Learning-for-Market-Making

Using tabular and deep reinforcement learning methods to infer optimal market making strategies

Language:Jupyter NotebookStargazers:163Issues:4Issues:0

RAP

Reasoning with Language Model is Planning with World Model

Language:PDDLLicense:MITStargazers:144Issues:3Issues:8