Beast code in Giters

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Language:PythonApache-2.0106400

parkour

[CoRL 2023] Robot Parkour Learning

Language:PythonMIT53000

AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookApache-2.01060000

OpenGPTS

Language:TypeScript17700

MARL-papers-with-code

Multi-Agent Reinforcement Learning (MARL) papers with code

29800

RL_draw_seabron

Use seaborn to draw RL picture

Language:Jupyter Notebook2400

vhmap

一个简洁易用3D场景创建和控制工具。基于ThreeJS。纯Python接口。它适用于科研、多智能体强化学习领域的3D演示、娱乐等应用。

Language:PythonMIT3300

BehaviorTree.CPP

Behavior Trees Library in C++. Batteries included.

Language:C++MIT293500

D4RL

A collection of reference environments for offline reinforcement learning

Language:PythonApache-2.0129500

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.03320900

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

Apache-2.087700

OfflineRL

A collection of offline reinforcement learning algorithms.

Language:PythonApache-2.015400

Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

BSD-3-Clause281000

awesome-ml4co

Awesome machine learning for combinatorial optimization papers.

Language:Python163600

Adversarial-Reinforcement-Learning-Papers

Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)

5600

MARL-resources-collection

A Collection of Multi-Agent Reinforcement Learning (MARL) Resources

19600

MOBA_RL

Deep Reinforcement Learning for Multiplayer Online Battle Arena

Language:PythonMIT7200

awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

90600

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonGPL-3.06436400

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION538200

ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

Language:PythonNOASSERTION365800

xuance

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Language:PythonMIT59900

Officer-No1

Officer-No1's starred repositories

imitation

Large-Language-Models-play-StarCraftII

PPOxFamily

ppo-implementation-details

robotics-fm-survey

swarm_ros_bridge

MARLlib

CORL