owensgroup's repositories
merge-spmm
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
MVGpuBTree
GPU B-Tree with support for versioning (snapshots).
UnifiedShaderSpecialization
Source code supporting the High Performance Graphics 2022 paper: Supporting Unified Shader Specialization by Co-opting C++ Features
ml_perf_model
ML performance model for GPU training of DLRM and more.
GPUQuotientFilters
Implementations of two types of quotient filters using GPUs
optix_splats
Testing different OIT methods for Gaussian Splatting
sparsify.me
A simple C++14 and CUDA-based header-only library with tools for sparse-machine learning.
GPUMaximumClique
A maximum clique solver for GPUs
RXMeshTemplate
A template showing how to use RXMesh
Osama-Exit-Seminar
Muhammad Osama's exit seminar slides and abstract.
pytorch_block_sparse
Fast Block Sparse Matrices for Pytorch
TrafficSignBench
A benchmark for deep learning frameworks on traffic sign classification/detection task on GPU and FPGA
application_classification
CUDA implementation of application classification via belief propagation