semi-ergodic's repositories

Language:PythonStargazers:20Issues:1Issues:0

RLlib-with-Dict-State

A minimal example demonstrating how to use RLlib with states which are presented as dictionaries

Language:PythonStargazers:11Issues:0Issues:0

narrow-corridor-ai

A reinforcement learning project for crowd-dynamics in a very narrow corridor

Language:PythonStargazers:4Issues:2Issues:0

simplest-world-Actor-Critic

Reinforcement learning, Policy Gradient, Actor-Critic, AC, Agent-based Simulation, Simple-world

Language:PythonStargazers:4Issues:1Issues:0

epydemy

A city with its citizens and their social life are simulated where a contagious disease is spreading. The results are used to feed a neural network for predicting the probability of catching the disease for each individual

Language:PythonStargazers:3Issues:2Issues:0

recruiter-problem

The secretary problem is a problem that demonstrates a scenario involving optimal stopping theory.

cart-pole-deep-RL-actor-critic

Solving the inverted pendulum problem with deep-RL actor-critic (with shared network between the value-evaluation and the policy, epsilon-greedy policy). Some implementation issues concerning the stability are discussed.

epydemy-ai

Using the predictions of the agent-based simulation code epydemy, we train a deep neural-network to help identifying the individual susceptibile to catching the virus (the high-risk group). This deep neural-network enables us to find the quarantine-policy most effectively.

Language:PythonStargazers:2Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

moo_as_soo

addressing multi-objective optimization as a single objective optimization with RL

Language:PythonLicense:NOASSERTIONStargazers:2Issues:2Issues:0

secretary-problem-env

An environment compatible with open-AI gym for the secretary problem

Language:PythonStargazers:2Issues:0Issues:0

simplest-world-REINFORCE

Reinforcement learning, Policy Gradient, REINFORCE, Agent-based Simulation, Simple-world

Language:PythonStargazers:2Issues:0Issues:0

aug-net

A small tutorial on how to calculate the Jacobian of the outputs wrt inputs

Language:PythonStargazers:1Issues:1Issues:0

cart-pole-deep-RL-DDQN

Solving the inverted pendulum problem with deep-RL double DQN. Some implementation issues and tests are discussed.

Language:PythonStargazers:1Issues:1Issues:0

communicative-MARL-v1

A multi-agent RL where the agents learn "what" to communicate with each other.

Language:PythonStargazers:1Issues:0Issues:0

Furnace-Env

a furnace environment compatible with Gymnasium

Language:PythonStargazers:1Issues:0Issues:0

multi-agent-trains-env

An environment (in openAI gym sense of the word) with multiple agents as a test bed for MA-RL algorithms

Language:PythonStargazers:1Issues:1Issues:0

munching-recipes

web scraping www.allrecipes.com and returning info in form of Python dictionaries

Language:PythonStargazers:1Issues:1Issues:0

acme

A library of reinforcement learning components and agents

License:Apache-2.0Stargazers:0Issues:0Issues:0

anisotropic-very-very-simple-MD

[playground codes] This is a prototype MD (a one filer!!) code with anisotropic particle interactions

Language:C++Stargazers:0Issues:1Issues:0

harmonic-oscillator-pinn

Code accompanying my blog post: So, what is a physics-informed neural network?

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

MagLorGasCpp

A code for magnetic Lorentz gas with obstacles which are made of polygon.

Language:C++Stargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

uneven_maze

a simple maze with an uneven surface

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0