Benjamin Brock's repositories
cuda-tutorial
Small CUDA tutorial for CS 267 Students
rdma-collectives
Some starter examples for a project implementing RDMA-based collectives
cusplibrary
CUSP : A C++ Templated Sparse Matrix Library
custom_allocator
Playing around with custom allocators that interoperate with STL data structures.
get_element
Implementation and examples for P2769 `std::ranges::get_element`
graphblast
High-Performance Linear Algebra-based Graph Primitives on GPUs
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
MIT000
oneMKL
oneAPI Math Kernel Library (oneMKL) Interfaces
Apache-2.0000