maohangyu

maohangyu

Geek Repo

Github PK Tool:Github PK Tool

maohangyu's repositories

TIT_open_source

The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"

marl_demo

demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention Multi-Agent DDPG) and NCC-MARL (Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning).

Language:PythonStargazers:33Issues:1Issues:0

PET-SQL

PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency

Language:PythonStargazers:10Issues:0Issues:0

PDiT

PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024 (full paper with oral presentation).

Stargazers:7Issues:0Issues:0

ptde-open

This is codes of PTDE algorithms. Accepted by IJCAI 2024. Ptde: Personalized training with distilled execution for multi-agent reinforcement learning.

Language:PythonStargazers:2Issues:0Issues:0

codes

The source code of CodeS (SIGMOD 2024).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dreamer-torch

Pytorch version of Dreamer, which follows the original TF v2 codes.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Dreamer_PyTorch

Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

License:Apache-2.0Stargazers:0Issues:0Issues:0

MPC

Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning

Language:PythonStargazers:0Issues:0Issues:0

MPC_template-model_predictive_control_for_reinforcement_learning

Pytorch version of the MPC in model-based reinforcement learning (MBRL), currently only test in the CartPole-swing-up environment

Language:PythonStargazers:0Issues:0Issues:0

open-interpreter

OpenAI's Code Interpreter in your terminal, running locally

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0