Gregory Palmer's repositories

nui_in_madrl

Negative Update Intervals in Multi-Agent Deep Reinforcement Learning

Language:PythonLicense:GPL-3.0Stargazers:32Issues:2Issues:1
Language:Jupyter NotebookStargazers:10Issues:2Issues:1

fms_marl

Scalable cooperative Multi-Agent-Reinforcement-Learning for order-controlled on schedule manufacturing in flexible manufacturing systems

Language:JavaLicense:MITStargazers:1Issues:1Issues:0

CycleGAN-tensorflow

Tensorflow implementation for learning an image-to-image translation without input-output pairs. https://arxiv.org/pdf/1703.10593.pdf

Language:PythonStargazers:0Issues:0Issues:0

deep-rl-tensorflow

TensorFlow implementation of Deep Reinforcement Learning papers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

gjp1203.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

mujoco-py

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Language:CythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Orion

A machine learning library for detecting anomalies in signals.

License:MITStargazers:0Issues:0Issues:0

plark_ai_public

Montvieux has developed “The hunting of the PLARK” Artificial Intelligence (AI) testbed to support a Hackathon activity at the Alan Turing Institute (ATI). The testbed is very flexible and will support both short term exercises in the Hackathon and provide a basis for more extensive, long-term, and cutting edge research. The test bed can be used as a basis to research the limits of agent generalisation, co-operation, and deception in a defence environment.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

rl_tournament

Reinforcement Learning Tournament Director

Language:PythonStargazers:0Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0