Officer-No1

Officer-No1

Geek Repo

Github PK Tool:Github PK Tool

Officer-No1's starred repositories

imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Language:PythonLicense:MITStargazers:1277Issues:0Issues:0

Large-Language-Models-play-StarCraftII

TextStarCraft2,a pure language env which support llms play starcraft2

Language:PythonStargazers:192Issues:0Issues:0

PPOxFamily

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Language:PythonLicense:Apache-2.0Stargazers:1910Issues:0Issues:0

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Language:PythonLicense:NOASSERTIONStargazers:618Issues:0Issues:0

robotics-fm-survey

Survey Paper of foundation models for robotics

Stargazers:308Issues:0Issues:0

swarm_ros_bridge

A lightweight middle interface that enables specified ROS message transmission among swarm robots through socket communication

Language:C++License:BSD-3-ClauseStargazers:62Issues:0Issues:0

MARLlib

Multi-agent Reinforcement Learning (MARL) Version of RLlib, integrated with Awesome-MCS

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Language:PythonLicense:Apache-2.0Stargazers:1064Issues:0Issues:0

parkour

[CoRL 2023] Robot Parkour Learning

Language:PythonLicense:MITStargazers:530Issues:0Issues:0

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10600Issues:0Issues:0

OpenGPTS

OpenGPTs- Powerful GPTs Colipot | 强大的gpts浏览器插件|多窗口|批量对话|chatgpt3.5|chatgpt4.0

Language:TypeScriptStargazers:177Issues:0Issues:0

MARL-papers-with-code

Multi-Agent Reinforcement Learning (MARL) papers with code

Stargazers:298Issues:0Issues:0

RL_draw_seabron

Use seaborn to draw RL picture

Language:Jupyter NotebookStargazers:24Issues:0Issues:0

vhmap

一个简洁易用3D场景创建和控制工具。基于ThreeJS。纯Python接口。它适用于科研、多智能体强化学习领域的3D演示、娱乐等应用。

Language:PythonLicense:MITStargazers:33Issues:0Issues:0

BehaviorTree.CPP

Behavior Trees Library in C++. Batteries included.

Language:C++License:MITStargazers:2935Issues:0Issues:0

D4RL

A collection of reference environments for offline reinforcement learning

Language:PythonLicense:Apache-2.0Stargazers:1295Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:33209Issues:0Issues:0

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

License:Apache-2.0Stargazers:877Issues:0Issues:0

OfflineRL

A collection of offline reinforcement learning algorithms.

Language:PythonLicense:Apache-2.0Stargazers:154Issues:0Issues:0

Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

License:BSD-3-ClauseStargazers:2810Issues:0Issues:0

awesome-ml4co

Awesome machine learning for combinatorial optimization papers.

Language:PythonStargazers:1636Issues:0Issues:0

Adversarial-Reinforcement-Learning-Papers

Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)

Stargazers:56Issues:0Issues:0

MARL-resources-collection

A Collection of Multi-Agent Reinforcement Learning (MARL) Resources

Stargazers:196Issues:0Issues:0

MOBA_RL

Deep Reinforcement Learning for Multiplayer Online Battle Arena

Language:PythonLicense:MITStargazers:72Issues:0Issues:0

awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

Stargazers:906Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:64364Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:5382Issues:0Issues:0

ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

Language:PythonLicense:NOASSERTIONStargazers:3658Issues:0Issues:0

xuance

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Language:PythonLicense:MITStargazers:599Issues:0Issues:0