owensgroup

owensgroup's repositories

RXMesh

RXMesh: A GPU Mesh Data Structure - SIGGRAPH 2021

Language:C++BSD-2-Clause195 23 8

SlabHash

A warp-oriented dynamic hash table for GPUs

Language:CudaApache-2.069 10 12

merge-spmm

Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018

Language:C++Apache-2.067 29 10

BGHT

BGHT: High-performance static GPU hash tables.

Language:C++Apache-2.051 16 15

GpuBTree

Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019

Language:CudaBSD-2-Clause48 15 15

MVGpuBTree

GPU B-Tree with support for versioning (snapshots).

Language:C++Apache-2.036 18 6

gpustats

Statistics on GPUs

Language:HTMLBSD-3-Clause28 27 3

ATOS

Multi-GPU dynamic scheduler using PGAS style cross-GPU communication

Language:Cuda25 180

UnifiedShaderSpecialization

Source code supporting the High Performance Graphics 2022 paper: Supporting Unified Shader Specialization by Co-opting C++ Features

Language:C++NOASSERTION14 190

push-pull

Code for paper "Implementing Push-Pull Efficiently in GraphBLAS" accepted to ICPP 2018

Language:C++Apache-2.010 27 2

SlabAlloc

A dynamic GPU memory allocator, suitable for warp synchronized scenarios.

Language:CudaApache-2.09 6 4

merge-spmv

Fork of Duane Merrill Merge-based Parallel Sparse Matrix-Vector Multiplication Artifact

Language:CudaBSD-3-Clause6 280

csgm

CUDA implementation of seeded graph matching

Language:Cuda5 4 1

GPUQuotientFilters

Implementations of two types of quotient filters using GPUs

Language:CudaApache-2.04 160

ml_perf_model

ML performance model for GPU training of DLRM and more.

Language:Jupyter NotebookBSD-3-Clause4 17 11

harmonic_cuda

sparsify.me

A simple C++14 and CUDA-based header-only library with tools for sparse-machine learning.

Language:C++3 4 7

sssp

Single-Source Shortest Path (SSSP) implementation in modern C++ for 2022 IPDPS workshop on Graphs, Architectures, Programming, and Learning (GrAPL 2022) submission.

Language:C++Apache-2.03 10

hiptracer

Capture and / or instrument HIP API calls

Language:C2 170

optix_splats

Testing different OIT methods for Gaussian Splatting

Language:C++205

graphblas_proj

Language:Cuda1 40

Osama-Exit-Seminar

Muhammad Osama's exit seminar slides and abstract.

Language:TeX1 30

rtx_nerf

An implementation of NeRF acceleration using RTX cores to compute ray-grid intersections

Language:CGPL-3.01 17 7

TrafficSignBench

A benchmark for deep learning frameworks on traffic sign classification/detection task on GPU and FPGA

Language:Python1 270

GPUMaximumClique

A maximum clique solver for GPUs

Language:CudaApache-2.00180

application_classification

CUDA implementation of application classification via belief propagation

Language:Cuda06 2

dynamic_sparsity_pytorch

Language:Python010

gunrock

High-Performance Graph Primitives on GPUs

Language:CudaApache-2.00280

NRLib

Neural Rendering Library

Language:Cuda000

pytorch_block_sparse

Fast Block Sparse Matrices for Pytorch

Language:C++NOASSERTION020