There are 1 repository under hpc-systems topic.
Location for the LSF Python wrapper for controlling all things LSF
TrinityX is the new generation of ClusterVision's open-source HPC, A/I and cloudbursting platform. It is designed from the ground up to provide all services required in a modern HPC and A/I system, and to allow full customization of the installation.
Pavilion is a Python 3 (3.5+) based framework for running and analyzing tests targeting HPC systems.
Unified and Modular Measurement Framework
This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Distant Reader, a tool for using & understanding a corpus
Location for the LSF DRMAA API originally creaded by FedStage
Location for the LSF Perl Module to manipulate all things LSF
Introduction to large scale computing and data wrangling with hands-on tutorials
A web application for job specific performance monitoring in HPC centers
Cloyster HPC is a turnkey HPC cluster solution with an user-friendly installer
A header-only library that provides wait-free ring buffer utilities for C++ objects.
An all-in-one solution for training, testing and maintaining your AI based software
Machine-learned potentials construction for high-entropy alloys properties using GPUMD package - UVA School of Engineering and Applied Science
Tracknodes keeps a history of node state and comment changes. It allows system administrators of HPC systems to determine when nodes were down and discover trends such as recurring issues. Supports Torque, PBSpro and SLURM.
Build slurm scheduler at Rocky Linux 8 with pmix using github actions.
An MPI Fault Tolerance Benchmark Suite
mimoch: MIchele's shell MOdulefiles CHecker
Command line monitoring tools for HPC systems with Slurm workload manager
An agent run on each node in HPC system
Illinois Campus Cluster Resources for Statistical Computing using R
Trying to find a maximum clique in a Graph
⏱️ 📈 💎 Projects based in High Performance Computing Labs. This projects was built using C++ (C Plus Plus) and CUDA (Compute Unified Device Architecture). This repository it's based in some practical lab exercises and examples related with High Performance Computing, among many others!
Benchmarking/Stress-Testing/Functionality Application Suite for Linux Clusters with an Emphasis on Data Science and Bioinformatics
all things SLURM.
This project optimizes HPC job performance by dynamically managing resource allocation, parallelizing tasks, and balancing loads. It uses real-time profiling and monitoring to identify performance bottlenecks and adjust resource distribution, improving job efficiency and reducing idle time.
An introduction to HPC
Python script to automate fetching SSH keys to access the CSCS infrastructure