xiaoyangyang2

xiaoyangyang2

Geek Repo

Github PK Tool:Github PK Tool

xiaoyangyang2's repositories

AlgoXY

Book of Elementary Algorithms and Data structures

Language:TeXStargazers:0Issues:0Issues:0

AutoCrawler

Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

daily_arxiv

Using GitHub Action to collect paper list with publicly available source code in the daily arxiv

Stargazers:0Issues:0Issues:0

Deep-Reinforcement-Learning-Hands-On-Second-Edition

Deep-Reinforcement-Learning-Hands-On-Second-Edition, published by Packt

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

deep-rl-class

This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

ElegantRL

Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

License:Apache-2.0Stargazers:0Issues:0Issues:0

gtrick

Bag of Tricks for Graph Neural Networks.

License:MITStargazers:0Issues:0Issues:0

Hands-on-RL

https://hrl.boyuai.com/

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

InforMARL

Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

invalid-action-masking

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

License:MITStargazers:0Issues:0Issues:0

mader

Trajectory Planner in Multi-Agent and Dynamic Environments

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0

marl_transfer

Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)

License:MITStargazers:0Issues:0Issues:0

Multi-Agent-Constrained-Policy-Optimisation

Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

omnisafe

OmniSafe is an infrastructural framework for accelerating SafeRL research.

License:Apache-2.0Stargazers:0Issues:0Issues:0

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

panther

Perception-Aware Trajectory Planner in Dynamic Environments

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

PARL

A high-performance distributed training framework for Reinforcement Learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PGL

Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

planning

List of planning algorithms developed at MIT-ACL

Stargazers:0Issues:0Issues:0

PPOxFamily

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Practical_RL

A course in reinforcement learning in the wild

License:UnlicenseStargazers:0Issues:0Issues:0

privateGPT

Interact privately with your documents using the power of GPT, 100% privately, no data leaks

License:Apache-2.0Stargazers:0Issues:0Issues:0

RACE

(ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution

Stargazers:0Issues:0Issues:0

Safe-Reinforcement-Learning-Baselines

The repository is for safe reinforcement learning baselines.

Stargazers:0Issues:0Issues:0

TD3

Author's PyTorch implementation of TD3 for OpenAI gym tasks

License:MITStargazers:0Issues:0Issues:0

transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

License:MITStargazers:0Issues:0Issues:0

uav_bs_ctrl

Code implementation of "Cooperative Trajectory Design of Multiple UAV Base Stations with Heterogeneous Graph Neural Networks".

Language:PythonStargazers:0Issues:0Issues:0

WZU-machine-learning-course

温州大学《机器学习》课程资料(代码、课件等)

Stargazers:0Issues:0Issues:0