Nicholas Malaya's starred repositories
OpenFOAM_HMM
Refactoring OpenFOAM with OpenMP target offloading and use of HMM to offload work onto GPUs
rccl-tests
RCCL Performance Benchmark Tests
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
Quicksilver
A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037
olcf-user-docs
Sources for the Oak Ridge Leadership Computing Facility User Documentation
sysconfidence
System Confidence - a system latency analysis benchmark
ECP-ST-CAR-PUBLIC
The Exascale Computing Project Software Technologies Capability Assessment Report - Public Version
benchmarks
A benchmark framework for Tensorflow