Trey White's repositories
fishfry
A benchmark that solves a 3D Poisson problem using MPI communication and GPU computation.
always
MPI all-to-all microbenchmarks with sub-communicators.
E3SM
Energy Exascale Earth System Model source code.
scream
Exascale global atmosphere model written in C++ as part of the E3SM project
sumtimes
Microbenchmark for MPI_Allreduce with MPI_SUM using GPU memory.
amrex-tutorials
Tutorials for the AMReX framework.
faces
Microbenchmarks for testing programming strategies for nearest-neighbor communication with GPU-aware MPI.
Quicksilver
A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037
olcf-user-docs
Sources for the Oak Ridge Leadership Computing Facility User Documentation
slate
SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) systems. It is developed as part of the U.S. Department of Energy Exascale Computing Project (ECP).
pairs
An MPI microbenchmark that measures the performance pairwise parallel ping pongs.
cholla
A GPU-based hydro code
schmccl-tests
[NR]CCL tests implemented in MPI
rccl-tests
RCCL Performance Benchmark Tests
Comb
Comb is a communication performance benchmarking tool.
codesign-kernels
Climate kernels shared with interested parties for co-design
hipfort
Fortran interfaces for ROCm libraries
cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
lsms
LSMS is a code for scalable first principles calculations of materials using multiple scattering theory.
Tensile
Stretching GPU performance for GEMMs and tensor contractions.
timers
Timers for C++ with MPI and OpenMP.
clang
Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project
small_dgemms
Mini-app investigating performance of many small dgemm operations.