owensgroup

owensgroup

Geek Repo

Github PK Tool:Github PK Tool

owensgroup's repositories

RXMesh

RXMesh: A GPU Mesh Data Structure - SIGGRAPH 2021

Language:C++License:BSD-2-ClauseStargazers:195Issues:23Issues:8

SlabHash

A warp-oriented dynamic hash table for GPUs

Language:CudaLicense:Apache-2.0Stargazers:69Issues:10Issues:12

merge-spmm

Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018

Language:C++License:Apache-2.0Stargazers:67Issues:29Issues:10

BGHT

BGHT: High-performance static GPU hash tables.

Language:C++License:Apache-2.0Stargazers:51Issues:16Issues:15

GpuBTree

Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019

Language:CudaLicense:BSD-2-ClauseStargazers:48Issues:15Issues:15

MVGpuBTree

GPU B-Tree with support for versioning (snapshots).

Language:C++License:Apache-2.0Stargazers:36Issues:18Issues:6

gpustats

Statistics on GPUs

Language:HTMLLicense:BSD-3-ClauseStargazers:28Issues:27Issues:3

ATOS

Multi-GPU dynamic scheduler using PGAS style cross-GPU communication

Language:CudaStargazers:25Issues:18Issues:0

UnifiedShaderSpecialization

Source code supporting the High Performance Graphics 2022 paper: Supporting Unified Shader Specialization by Co-opting C++ Features

Language:C++License:NOASSERTIONStargazers:14Issues:19Issues:0

push-pull

Code for paper "Implementing Push-Pull Efficiently in GraphBLAS" accepted to ICPP 2018

Language:C++License:Apache-2.0Stargazers:10Issues:27Issues:2

SlabAlloc

A dynamic GPU memory allocator, suitable for warp synchronized scenarios.

Language:CudaLicense:Apache-2.0Stargazers:9Issues:6Issues:4

merge-spmv

Fork of Duane Merrill Merge-based Parallel Sparse Matrix-Vector Multiplication Artifact

Language:CudaLicense:BSD-3-ClauseStargazers:6Issues:28Issues:0

csgm

CUDA implementation of seeded graph matching

GPUQuotientFilters

Implementations of two types of quotient filters using GPUs

Language:CudaLicense:Apache-2.0Stargazers:4Issues:16Issues:0

ml_perf_model

ML performance model for GPU training of DLRM and more.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4Issues:17Issues:11

sparsify.me

A simple C++14 and CUDA-based header-only library with tools for sparse-machine learning.

sssp

Single-Source Shortest Path (SSSP) implementation in modern C++ for 2022 IPDPS workshop on Graphs, Architectures, Programming, and Learning (GrAPL 2022) submission.

Language:C++License:Apache-2.0Stargazers:3Issues:1Issues:0

hiptracer

Capture and / or instrument HIP API calls

Language:CStargazers:2Issues:17Issues:0

optix_splats

Testing different OIT methods for Gaussian Splatting

Language:C++Stargazers:2Issues:0Issues:5

Osama-Exit-Seminar

Muhammad Osama's exit seminar slides and abstract.

Language:TeXStargazers:1Issues:3Issues:0

rtx_nerf

An implementation of NeRF acceleration using RTX cores to compute ray-grid intersections

Language:CLicense:GPL-3.0Stargazers:1Issues:17Issues:7

TrafficSignBench

A benchmark for deep learning frameworks on traffic sign classification/detection task on GPU and FPGA

Language:PythonStargazers:1Issues:27Issues:0

GPUMaximumClique

A maximum clique solver for GPUs

Language:CudaLicense:Apache-2.0Stargazers:0Issues:18Issues:0

application_classification

CUDA implementation of application classification via belief propagation

Language:CudaStargazers:0Issues:6Issues:2
Language:PythonStargazers:0Issues:1Issues:0

gunrock

High-Performance Graph Primitives on GPUs

Language:CudaLicense:Apache-2.0Stargazers:0Issues:28Issues:0

NRLib

Neural Rendering Library

Language:CudaStargazers:0Issues:0Issues:0

pytorch_block_sparse

Fast Block Sparse Matrices for Pytorch

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0