Edan Toledo (EdanToledo)

EdanToledo

Geek Repo

Company:@instadeepai

Github PK Tool:Github PK Tool

Edan Toledo's repositories

Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Language:PythonLicense:Apache-2.0Stargazers:177Issues:5Issues:23

JAX-MAML

Super simple implementation of MAML for RL in JAX

Language:PythonStargazers:4Issues:1Issues:0

RL-Algorithms

Jupyter Notebooks of minimal Reinforcement Learning Algorithms

Language:Jupyter NotebookStargazers:2Issues:1Issues:0

dreamerv3-1

Mastering Diverse Domains through World Models

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

jax-dreamer

Dreamer on JAX

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

ai-economist

Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Basic-NN

Basic Neural Network that uses either a threshold or sigmoid activation function. Nodes use the perceptron learning rule.

Language:C++Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Connect-Four-Against-AlphaZero

Simple hack to existing connect 4 javascript app to allow for AlphaZero model to play online

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CSC3022F-Huffman-Encoding

C++ implementation of a huffman tree and encoding - Compress and Decompress text files

Language:C++Stargazers:0Issues:1Issues:0

CSC3022F-K-Means-Clustering

K means clustering assignment for CSC3022F

Language:C++Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Language-Modelling-Pytorch

N-gram Language Model using PyTorch

Language:PythonStargazers:0Issues:1Issues:0

dejax

Accelerated replay buffers in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DQN-and-Actor-Critic-PyTorch

Really simple implementation of DQN in pytorch for gym environments

Language:PythonStargazers:0Issues:1Issues:0

DuelingDDQN-and-AlphaZero

Implementation of DQN, DDQN and Dueling (D)DQN to play Pong. AlphaZero implementation to play Connect4

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

GridWorldRLModelFree

Model Free TD(λ) implementation for pathfinding in a grid world. Makes use of Q Learning

Language:PythonStargazers:0Issues:1Issues:0

gymnax

RL Environments in JAX 🌍

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

IQA

Extensions to Yuan et al. QAit task.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

marl-eval

A tool for aggregating and plotting MARL experiment data.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Mava

🦁 A library of multi-agent reinforcement learning systems and components

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ModelRenderingOpenGL

Simple OpenGL model rendering with phong shading

Language:C++Stargazers:0Issues:1Issues:0

PCA

Answering PCA Question Assignment 5

Language:C++Stargazers:0Issues:0Issues:0

popjym

POPGym Library in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

REINFORCE-PyTorch

Simple Implementation of REINFORCE and PPO

Language:PythonStargazers:0Issues:1Issues:0

VectorizedMultiAgentSimulator

VMAS is a vectorized framework designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0