markpp / GPU_monitoring

Monitoring computers/GPU usage

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GPU Monitoring - AURORA (Automated Utility for Real Time Observation and Reporting of Activity)

vaplab Logo

This repository contains the development of a GPU monitoring tool called AURORA (Automated Utility for Real Time Observation and Reporting of Activity) for the VAPLAB (Visualization and Performance Laboratory) at Aalborg University.

AURORA is a powerful tool designed to monitor GPU utilization on remote machines. It provides real-time observation and reporting of GPU activity, allowing researchers and system administrators to effectively monitor and analyze GPU performance.

Table of Contents

Introduction

The GPU Monitoring repository contains the development of AURORA, a comprehensive tool for monitoring and analyzing the utilization of GPUs on remote machines. AURORA provides real-time monitoring of GPU activity and generates detailed reports for performance analysis.

Components

This repository is organized into the following subdirectories, each focusing on a specific aspect of AURORA:

  • API: This directory contains the API module responsible for handling communication between the monitoring tool and the remote machines.

  • Gathering DATA: Here, you'll find the module responsible for gathering GPU utilization data from the remote machines.

  • Display: This directory contains the module responsible for displaying real-time GPU activity and generating visualizations.

  • Data Storage: Here, you'll find the module responsible for storing and managing the collected GPU utilization data.

Please refer to the respective README files in each subdirectory for detailed information on their functionalities, usage, and configuration.

Getting Started with Docker-Compose

To get started with GPU monitoring-AURORA using Docker-Compose, follow these steps:

  • Make sure you have Docker and Docker-Compose installed on your machine. If not, please refer to the Docker documentation for instructions on how to install them.

  • Clone the GPU Monitoring repository to your local machine:

git clone https://github.com/Ombrustou/GPU_monitoring.git
  • Navigate to the root directory of the repository:
cd GPU_monitoring
  • Open the docker-compose.yml file and review the configuration. Ensure that the ports 80, 27017, and 3001 are not already in use on your machine. If they are, you may need to modify the docker-compose.yml file to use different ports or free up the conflicting ports on your machine, if your not familiar with that follow the Getting started with Docker.

  • Run the following command to start the Docker containers using Docker-Compose:

sudo docker-compose up

The Docker-Compose command will build and start the necessary containers for AURORA, including the API, data gathering, display, and data storage components.

Now Aurora should be running and the UI should be accessible in your navigator here

Getting Started with Docker

To get started with GPU monitoring-AURORA using Docker follow these steps:

  • Make sure you have Docker installed on your machine. If not, please refer to the Docker documentation for instructions on how to install it.

    Using pre-built Images

    • Create a folder to host the database
    mkdir data
    • Run the data_storage container
    docker run -d --name data-storage-container -v data:/data/db --network=host ghcr.io/ombrustou/gpu_monitoring/data-storage:latest
    • Run the API container
    docker run -d --name api-container --network=host ghcr.io/ombrustou/gpu_monitoring/api:latest
    • Run the Gathering_DATA container
    docker run -d --name data-gathering-container --network=host ghcr.io/ombrustou/gpu_monitoring/gathering-data:latest
    • Run the Display container
    docker run -d --name display-container --network=host ghcr.io/ombrustou/gpu_monitoring/display:latest

    Everything should be running fine. If you get any problem with docker refer to the Building yourself the Images part.

    Building yourself the Images

    • Clone the GPU Monitoring repository to your local machine (only if you want to build images by youself):
    git clone https://github.com/Ombrustou/GPU_monitoring.git
    • Navigate to the root directory of the repository:
    cd GPU_monitoring

Feel free to adjust the container names, ports, and shared folder path to suit your specific setup. Enjoy monitoring and analyzing GPU activity using AURORA !

Contributors

AURORA - GPU Monitoring is developed by the dedicated team of interns mentored by Mark Philip Philipsen at the VAPLAB of Aalborg University.

Interns:

Feel free to explore the different subdirectories and begin monitoring and analyzing GPU activity using AURORA.

About

Monitoring computers/GPU usage


Languages

Language:JavaScript 67.2%Language:HTML 19.3%Language:Python 8.3%Language:CSS 3.3%Language:Shell 1.3%Language:Dockerfile 0.6%