Donal Byrne's repositories

Landing-A-Rocket-With-Simple-Reinforcement-Learning

This repo gives an example of using a simple method of reinforcement learning to beat the Lunar Lander environment. The agent uses a combination of CEM and neural networks using the pytorch library.

Language:Jupyter NotebookStargazers:19Issues:3Issues:0

MonteCarlo

Implementation of first visit Monte Carlo for prediction and control

Language:Jupyter NotebookStargazers:12Issues:2Issues:0

TD3

Implementation of the TD3 algorithm written in Pytorch

Language:Jupyter NotebookStargazers:10Issues:2Issues:0

core_rl

Repo of core reinforcement learning algorithms and explanations using pytorch lightning

CNN-On-The-Cloud-

Code used to build an image classifier for the Fashion MNIST dataset. Built using the Keras library and trained on the FloydHub cloud platform

Language:Jupyter NotebookStargazers:4Issues:2Issues:1

DDPG_Reacher

Experiment to implement the DDPG algorithm to train a mechanical arm to reach for a moving target inside the unity ML-Agents virtual environment

Language:Jupyter NotebookStargazers:2Issues:2Issues:0

DQN_Tensorflow

A jupyter notebook implementing the DQN model in VizDoom

Language:Jupyter NotebookStargazers:2Issues:2Issues:0

Neural-Network-From-Scratch-Tumour-Diagnosis

This notebook goes through how to build a neural network using only numpy. The network classifies tumours, identifying if they are malignant or benign. This notebook uses the Breast Cancer Wisconsin dataset.

Language:Jupyter NotebookStargazers:2Issues:2Issues:0

SAC

Pytorch implementation of the Soft Actor Critic Algorithm

Language:Jupyter NotebookStargazers:2Issues:1Issues:0

awesome-prompt-engineering

repo containing useful prompt engineering templates that I use for coding, research and productivity

deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

Language:Jupyter NotebookLicense:MITStargazers:1Issues:2Issues:0

FlappyBirdRL

A reinforcement learning environment based on the mobile game "Flappy Bird" built using the Unity ml-agents framework

Language:C#Stargazers:1Issues:3Issues:0

MADDPG

Final project for the Udacity RL nano degree implementing Multi Agent Deep Deterministic Policy Gradients

Language:ASPStargazers:1Issues:2Issues:0

acme

A library of reinforcement learning components and agents

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

AI_Hack_NIA

Nia is nutrition app using object classification and detection to gives users nutritional information about their meals and if the portion size is correct

Language:PythonStargazers:0Issues:4Issues:0

DDQN_Navigation

This is my submission for the Udacity navigation project in the Deep Reinforcement Learning Nano Degree

Language:ASPStargazers:0Issues:2Issues:0

djbyrne.github.io

Personal blog

Language:HTMLLicense:MITStargazers:0Issues:2Issues:0

dm-haiku

JAX-based neural network library

License:Apache-2.0Stargazers:0Issues:0Issues:0

Halite-III

Season 3 of @twosigma's artificial intelligence programming challenge

Language:JavaScriptLicense:MITStargazers:0Issues:2Issues:0

jumanji

A diverse suite of scalable reinforcement learning environments in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Neural-Network-From-Scratch-Part-2-TensorFlow

This notebook takes the the same dataset used in the previous notebook and it builds a classification network with tensor flow to diagnose cancer tumours.

Language:PythonStargazers:0Issues:2Issues:0

optax

Optax is a gradient processing and optimization library for JAX.

License:Apache-2.0Stargazers:0Issues:0Issues:0

ptan

PyTorch Agent Net: reinforcement learning toolkit for pytorch

Language:PythonStargazers:0Issues:0Issues:0

pytorch-lightning

The lightweight PyTorch wrapper for ML researchers. Scale your models. Write less boilerplate

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pytorch-lightning-bolts

PyTorch Lightning Bolts is a community contribution for AI/ML researchers.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

RL_Workbench

Library containing Pytorch implementations of some of the main RL algorithms. This repo is used for my own learning purposes

Language:ASPStargazers:0Issues:2Issues:0

templates

Document templates for open-source projects (README, CONTRIBUTING, GitHub templates)

Language:GroovyLicense:MITStargazers:0Issues:2Issues:0

Value-Iteration

Simple implementation of value iteration using the Frozen Lake Environment

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

xland-minigrid

JAX-accelerated meta-reinforcement learning environments inspired by XLand and MiniGrid 🏎️

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0