antunsz / deep-reinforcement-learning

This project utilizes a notebook to train a model for balancing a matchstick on a box, leveraging Deep Q-Learning. It's an experiment in applying reinforcement learning techniques to solve the classic "CartPole" problem from OpenAI's Gym.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Deep Reinforcement Learning ๐Ÿค–๐ŸŽฎ

Python License

This project utilizes a notebook to train a model for balancing a matchstick on a box, leveraging Deep Q-Learning. It's an experiment in applying reinforcement learning techniques to solve the classic "CartPole" problem from OpenAI's Gym.

Overview ๐Ÿ“–

The goal is to develop an agent capable of maintaining a pole in an upright position as long as possible by moving the cart on which it's mounted. This project is an introduction to the concepts of Deep Q-Learning, including experience replay and the use of a neural network to approximate Q-values.

Getting Started ๐Ÿš€

To run this project, you will need:

  • Python 3.8 or above
  • An environment manager (e.g., conda or venv)
  • Jupyter Notebook or JupyterLab

Installation

  1. Clone the repository to your local machine.
  2. Create a virtual environment:
python -m venv venv
  1. Activate the virtual environment:
  • On Windows: venv\Scripts\activate
  • On Unix or MacOS: source venv/bin/activate
  1. Install the required packages:
pip install -r requirements.txt
  1. Open the notebook cartpole.ipynb in Jupyter Notebook or JupyterLab and follow the instructions.

Technologies Used ๐Ÿ› ๏ธ

  • Python: The main programming language used.
  • OpenAI Gym: Provides the CartPole environment.
  • TensorFlow: Used for creating and training the neural network.

License ๐Ÿ“„

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.

About

This project utilizes a notebook to train a model for balancing a matchstick on a box, leveraging Deep Q-Learning. It's an experiment in applying reinforcement learning techniques to solve the classic "CartPole" problem from OpenAI's Gym.

License:GNU General Public License v3.0


Languages

Language:Jupyter Notebook 100.0%