ppo-pytorch

There are 1 repository under ppo-pytorch topic.

nikhilbarhate99 / PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms
Language:Python 1479
Lizhi-sjtu / DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
ddpg-pytorch dqn-pytorch ppo-pytorch pytorch rainbow-dqn reinforcement-learning sac-pytorch td3-pytorch ppo-gru ppo-lstm
Language:Python 893
CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
a2c advantage-actor-critic deep-learning deep-reinforcement-learning gail gail-ppo imitation-learning policy-gradient ppo ppo-algo ppo-pytorch proximal-policy-optimization pytorch recurrent-ppo reinforcement-learning
Language:Python 127
akjayant / PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
lagrangian ppo ppo-pytorch pytorch-implementation reinforcement-learning safe-reinforcement-learning ppo-lagrangian
Language:Python 23
faildeny / Multi_Agent_PPO
Multi agent PPO implementation in Pytorch for Unity ML Agents environments.
reinforcement-learning multi-agent-reinforcement-learning ppo-pytorch unity-ml-agents reacher-environment
Language:Python 22
philtabor / ProtoRL
A Torch Based RL Framework for Rapid Prototyping of Research Papers
actor-critic ddpg ddpg-pytorch dqn dqn-pytorch dueling-ddqn dueling-dqn dueling-dqn-pytorch dueling-network-architecture prioritized-experience-replay sac sac-pytorch soft-actor-critic td3 td3-pytorch twin-delayed-policy-gradient ppo ppo-pytorch proximal-policy-optimization
Language:Python 20
jatinarora2702 / gail-pytorch
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
reinforcement-learning imitation-learning policy-gradient gail pytorch ppo-pytorch cartpole-v0 openai-gym
Language:Python 16
LittleWebCat / DRL-Base-EMS
DRL-Base-EMS for HEVs
deep-reinforcement-learning energy-management-strategies ppo-pytorch pytorch sac-pytorch hybrid-electrical-vehicle
Language:HTML 16
rvdweerd / simmodel
Solving pursuit-evasion problems on graphs using Reinfocement Learning and GNNs
reinforcement-learning ppo-pytorch dqn-pytorch gnn-algorithm pursuit-evasion deep-reinforcement-learning lstm-neural-networks partial-observability pomdp pytorch-geometric reinforcement-learning-algorithms graph-neural-networks graph-representation-learning
Language:Python 12
davide97l / PPO-GAIL-cartpole
GAIL learning to imitate PPO playing CartPole.
ppo ppo-pytorch gail irl irl-algorithms cartpole-v0
Language:Jupyter Notebook 11
Tic-Tac-Toe-Gym
francofgp / Tic-Tac-Toe-Gym
This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning
data-science reinforcement-learning gym machine-learning stable-baselines ai ppo-pytorch python
Language:Python 8
wegfawefgawefg / wegs-drl-baselines
Minimum viable reinforcement learning algorithms for your educational convenience.
reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-agent openai-gym ppo-pytorch ppo pytorch dqn dqn-pytorch actor-critic rainbow-dqn dueling-dqn machine-learning neural-networks deep-reinforcement-learning td3 noisy-dqn world-models world-models-rl
Language:Python 8
CherryPieSexy / rl_mario
Reinforcement learning (PPO) plays Mario.
reinforcement-learning ppo-pytorch ppo super-mario-bros
Language:Python 7
Git-123-Hub / reinforcement-learning-algorithm
implementation of reinforcement learning algorithm that is easy to read and understand
reinforcement-learning deep-reinforcement-learning pytorch dqn ddqn dueling-dqn ddqn-per prioritized-experience-replay reinforce reinforce-baseline ddpg td3 sac ppo ppo-pytorch
Language:Python 6
CutnFill_DeepRL
houssameehsain / CutnFill_DeepRL
Positioning a building mass on topography while minimizing the required cut and fill excavation volume using actor critic methods.
reinforcement-learning deep-reinforcement-learning pytorch python topography urban-planning urban-design a2c ppo-pytorch grasshopper
Language:Python 6
nkoorty / rl_parking
Repository with all source files relating to the 6CCE3EEP Final Year Project titled "Self Parking with Reinforcement Learning." The project was implemented using Python, and used PyGame, OpenAI Gym, and the Stable Baselines-3 libraries in order to implement a Proximal Policy Optimisation (PPO) algorithm.
ppo ppo-pytorch pygame reinforcement-learning stablebaselines3
Language:Python 6
SchweizerischeBundesbahnen / flatland-torchrl
An adaption of the Flatland environment for TorchRL.
flatland flatland-challenge ppo-pytorch pytorch reinforcement-learning torchrl
Language:Python 6
paulchen2713 / RIS-MISO-HWI-DRL
Worst-case MSE Minimization for RIS-assisted mmWave MU-MISO Systems with Hardware Impairments and CSI Imperfection
digital-beamforming reconfigurable-intelligent-surfaces reinforcement-learning wireless-communication gymnasium ppo-pytorch stable-baselines3
Language:Python 5
alirezakazemipour / Mario-PPO
ppo-pytorch super-mario-bros proximal-policy-optimization
Language:Python 4
faildeny / PPO_pytorch_implementation
Proximal Policy Optimization method in Pytorch
reinforcement-learning reinforcement-learning-excercises ppo ppo-pytorch openai-gym bipedalwalker
Language:Python 4
imoneoi / xrl-ppo
Automated & super fast PyTorch deep reinforcement learning platform for autonomous driving
deep-reinforcement-learning deep-learning reinforcement-learning autonomous-vehicles autonomous-driving automated-machine-learning pytorch ppo ppo-pytorch
Language:Python 4
rshnn / battleship
Agent trained to play battleship using reinforcement learning (PPO) and openAI gym
ppo-pytorch reinforcement-learning deep-reinforcement-learning battleship-game openai-gym-environments
Language:Jupyter Notebook 4
akashe / DeepReinforcementLearning
Deep RL implementations. DQN, SAC, DDPG, TD3, PPO and VPG implemented in pytorch. Tested Env: LunarLander-v2 and Pendulum-v0.
deep-rl-implementations algorithms pytorch-implementation lunarlander-v2 pendulum-v0 vpg sac ddpg td3 ppo ppo-pytorch
Language:Python 3
c2d08y / LearningBot
A deep reinforcement learning Bot for https://kana.byha.top:444/
bot deep-learning deep-neural-networks deep-reinforcement-learning gamebot neural-network nueral-networks ppo-agent ppo-algo ppo-pytorch ppo2 reinforcement-learning
Language:Python 3
leonjovanovic / drl-ppo-bipedal-walker
PyTorch application of reinforcement learning Advanced Policy Gradient algorithms in OpenAI BipedalWalker- PPO
ppo-pytorch ppo2 bipedalwalker ppo pytorch
Language:Python 3
Nikunj-Gupta / HAMMER
HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging (Paper: https://ala2021.vub.ac.be/papers/ALA2021_paper_35.pdf)
deep-reinforcement-learning multi-agent-reinforcement-learning ppo-pytorch reinforcement-learning
Language:Python 3
alex-nooj / champion_league
pokemon machine-learning reinforcement-learning ppo ppo-pytorch
Language:Python 2
anshdavid / pytorch-driving-torcs
self driving car using Torcs-1.3.7 simulator with server-patch
torcs torcs-rl torcs-client pytorch python3 cpp reinforcement-learning ppo-pytorch ddpg-pytorch
Language:Python 2
GuillermoVR92 / Deep-RL-Pong_with_PPO_Agent
Deep RL Agent using Proximal Policy Optimization for solving the Pong game.
reinforcement-learning pytorch deep-learning ppo-pytorch
Language:Jupyter Notebook 2
leonjovanovic / drl-ml-agents-3dball
PyTorch application of reinforcement learning DDPG and PPO algorithms in Unity 3D-Ball
ml-agents 3d-ball ddpg ddpg-pytorch ppo ppo-pytorch ppo2 drl
Language:Python 2
steph-koopmanschap / PyLife2
The Improved version of PyLife (now with AI)
ai artificial-intelligence ecosystem-simulation machine-learning neural-networks ppo-pytorch reinforcement-learning simulation
Language:Python 2
tomasspangelo / proximal-policy-optimization
An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.
deep-learning entropy generalized-advantage-estimation machine-learning open-ai open-ai-gym ppo ppo-pytorch proximal-policy-optimization python pytorch reinforcement-learning neural-network optimization gae actor-critic rl
Language:Python 2
DataRohit / AI-Mario-Game
This is a Deep-Q Learning [Stable Baseline] based AI Mario Game where the Model Incrementally Learns and Improves to Play the Game.
deep-learning mario-game ppo-pytorch python tensorboard
Language:Jupyter Notebook 1
Icyfiremario / PPO-Jumpstart
Basic PPO based AI template
ai machine-learning ppo-pytorch
Language:Python 1
marcpaulo15 / RL-connect4
Deep Reinforcement Learning algorithms to play Connect4 using a combination of Supervised Learning and Reinforcement Learning
alphago connect4-ai-game connect4-game deepqlearning dqn-pytorch ppo-pytorch pygame pygame-game pygame-games python pytorch pytorch-rl reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms two-player-game zero-sum-game zero-sum-games
Language:Python 1
muno-video-conferencing / muno
Muno server for bandwidth estimation in video conferencing
bandwidth-estimation deep-reinforcement-learning ppo-pytorch video-conferencing
1

ppo-pytorch

nikhilbarhate99 / PPO-PyTorch

Lizhi-sjtu / DRL-code-pytorch

CherryPieSexy / imitation_learning

akjayant / PPO_Lagrangian_PyTorch

faildeny / Multi_Agent_PPO

philtabor / ProtoRL

jatinarora2702 / gail-pytorch

LittleWebCat / DRL-Base-EMS

rvdweerd / simmodel

davide97l / PPO-GAIL-cartpole

francofgp / Tic-Tac-Toe-Gym

wegfawefgawefg / wegs-drl-baselines

CherryPieSexy / rl_mario

Git-123-Hub / reinforcement-learning-algorithm

houssameehsain / CutnFill_DeepRL

nkoorty / rl_parking

SchweizerischeBundesbahnen / flatland-torchrl

paulchen2713 / RIS-MISO-HWI-DRL

alirezakazemipour / Mario-PPO

faildeny / PPO_pytorch_implementation

imoneoi / xrl-ppo

rshnn / battleship

akashe / DeepReinforcementLearning

c2d08y / LearningBot

leonjovanovic / drl-ppo-bipedal-walker

Nikunj-Gupta / HAMMER

alex-nooj / champion_league

anshdavid / pytorch-driving-torcs

GuillermoVR92 / Deep-RL-Pong_with_PPO_Agent

leonjovanovic / drl-ml-agents-3dball

steph-koopmanschap / PyLife2

tomasspangelo / proximal-policy-optimization

DataRohit / AI-Mario-Game

Icyfiremario / PPO-Jumpstart

marcpaulo15 / RL-connect4

muno-video-conferencing / muno