Beast code in Giters

Donal Byrne's repositories

Landing-A-Rocket-With-Simple-Reinforcement-Learning

This repo gives an example of using a simple method of reinforcement learning to beat the Lunar Lander environment. The agent uses a combination of CEM and neural networks using the pytorch library.

Language:Jupyter Notebook19 30

MonteCarlo

Implementation of first visit Monte Carlo for prediction and control

Language:Jupyter Notebook12 20

TD3

Implementation of the TD3 algorithm written in Pytorch

Language:Jupyter Notebook10 20

core_rl

Repo of core reinforcement learning algorithms and explanations using pytorch lightning

Language:Python8 2 10

CNN-On-The-Cloud-

Code used to build an image classifier for the Fashion MNIST dataset. Built using the Keras library and trained on the FloydHub cloud platform

Language:Jupyter Notebook4 2 1

DDPG_Reacher

Experiment to implement the DDPG algorithm to train a mechanical arm to reach for a moving target inside the unity ML-Agents virtual environment

Language:Jupyter Notebook2 20

DQN_Tensorflow

A jupyter notebook implementing the DQN model in VizDoom

Language:Jupyter Notebook2 20

Neural-Network-From-Scratch-Tumour-Diagnosis

This notebook goes through how to build a neural network using only numpy. The network classifies tumours, identifying if they are malignant or benign. This notebook uses the Breast Cancer Wisconsin dataset.

Language:Jupyter Notebook2 20

SAC

Pytorch implementation of the Soft Actor Critic Algorithm

Language:Jupyter Notebook2 10

awesome-prompt-engineering

repo containing useful prompt engineering templates that I use for coding, research and productivity

1 20

deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

Language:Jupyter NotebookMIT1 20

FlappyBirdRL

A reinforcement learning environment based on the mobile game "Flappy Bird" built using the Unity ml-agents framework

Language:C#1 30

MADDPG

Final project for the Udacity RL nano degree implementing Multi Agent Deep Deterministic Policy Gradients

Language:ASP1 20

acme

A library of reinforcement learning components and agents

Apache-2.0000

adventures_in_cuda

Language:Jupyter Notebook000

AI_Hack_NIA

Nia is nutrition app using object classification and detection to gives users nutritional information about their meals and if the portion size is correct

Language:Python040

DDQN_Navigation

This is my submission for the Udacity navigation project in the Deep Reinforcement Learning Nano Degree

Language:ASP020

djbyrne.github.io

Personal blog

Language:HTMLMIT020

dm-haiku

JAX-based neural network library

Apache-2.0000

Halite-III

Season 3 of @twosigma's artificial intelligence programming challenge

Language:JavaScriptMIT020

jumanji

A diverse suite of scalable reinforcement learning environments in JAX

Language:PythonApache-2.0000

Neural-Network-From-Scratch-Part-2-TensorFlow

This notebook takes the the same dataset used in the previous notebook and it builds a classification network with tensor flow to diagnose cancer tumours.

Language:Python020