Beast code in Giters

semi-ergodic's repositories

RLlib-with-Dict-State

A minimal example demonstrating how to use RLlib with states which are presented as dictionaries

Language:Python11 20

narrow-corridor-ai

A reinforcement learning project for crowd-dynamics in a very narrow corridor

Language:Python4 20

simplest-world-Actor-Critic

Reinforcement learning, Policy Gradient, Actor-Critic, AC, Agent-based Simulation, Simple-world

Language:Python4 10

A city with its citizens and their social life are simulated where a contagious disease is spreading. The results are used to feed a neural network for predicting the probability of catching the disease for each individual

Language:Python3 20

recruiter-problem

The secretary problem is a problem that demonstrates a scenario involving optimal stopping theory.

Language:Python3 1 1

cart-pole-deep-RL-actor-critic

Solving the inverted pendulum problem with deep-RL actor-critic (with shared network between the value-evaluation and the policy, epsilon-greedy policy). Some implementation issues concerning the stability are discussed.

Language:Python2 2 1

epydemy-ai

Using the predictions of the agent-based simulation code epydemy, we train a deep neural-network to help identifying the individual susceptibile to catching the virus (the high-risk group). This deep neural-network enables us to find the quarantine-policy most effectively.

Language:Python200

Logistic-RL

Language:Python2 10

moo_as_soo

addressing multi-objective optimization as a single objective optimization with RL

Language:PythonNOASSERTION2 20

secretary-problem-env

An environment compatible with open-AI gym for the secretary problem

Language:Python2 10

simplest-world-REINFORCE

Reinforcement learning, Policy Gradient, REINFORCE, Agent-based Simulation, Simple-world

Language:Python200

aug-net

A small tutorial on how to calculate the Jacobian of the outputs wrt inputs

Language:Python1 10

cart-pole-deep-RL-DDQN

Solving the inverted pendulum problem with deep-RL double DQN. Some implementation issues and tests are discussed.

Language:Python1 10

communicative-MARL-v1

A multi-agent RL where the agents learn "what" to communicate with each other.

Language:Python1 10

Furnace-Env

a furnace environment compatible with Gymnasium

Language:Python1 20

multi-agent-trains-env

An environment (in openAI gym sense of the word) with multiple agents as a test bed for MA-RL algorithms

Language:Python1 10

munching-recipes

web scraping www.allrecipes.com and returning info in form of Python dictionaries

Language:Python1 10

RL-Heat-Treatment

Language:Python1 10

acme

A library of reinforcement learning components and agents

Apache-2.0000

anisotropic-very-very-simple-MD

[playground codes] This is a prototype MD (a one filer!!) code with anisotropic particle interactions

Language:C++010

harmonic-oscillator-pinn

Code accompanying my blog post: So, what is a physics-informed neural network?

Language:Jupyter NotebookMIT000

MagLorGasCpp

A code for magnetic Lorentz gas with obstacles which are made of polygon.

Language:C++010

nima-siboni.github.io

Language:HTML010

python-design-pattern-exercises

A collection of python design pattern exercises

Language:Python000

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.

Apache-2.0000

secretary-problem-RL-DDQN

Language:Python010

uneven_maze

a simple maze with an uneven surface

Language:PythonNOASSERTION010

unifying-the-format-of-citations-in-bibliography-file

Language:TeX010