owensgroup

owensgroup

Organization data from Github https://github.com/owensgroup

GitHub:@owensgroup

owensgroup's repositories

RXMesh

GPU-accelerated triangle mesh processing

Language:CudaLicense:BSD-2-ClauseStargazers:278Issues:24Issues:10

SlabHash

A warp-oriented dynamic hash table for GPUs

Language:CudaLicense:Apache-2.0Stargazers:76Issues:10Issues:13

merge-spmm

Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018

Language:C++License:Apache-2.0Stargazers:73Issues:27Issues:10

BGHT

BGHT: High-performance static GPU hash tables.

Language:C++License:Apache-2.0Stargazers:72Issues:15Issues:15

GpuBTree

Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019

Language:CudaLicense:BSD-2-ClauseStargazers:57Issues:14Issues:15

MVGpuBTree

GPU B-Tree with support for versioning (snapshots).

Language:C++License:Apache-2.0Stargazers:50Issues:17Issues:6

gpustats

Statistics on GPUs

Language:HTMLLicense:BSD-3-ClauseStargazers:31Issues:26Issues:3

ATOS

Multi-GPU dynamic scheduler using PGAS style cross-GPU communication

Language:CudaStargazers:29Issues:16Issues:0

UnifiedShaderSpecialization

Source code supporting the High Performance Graphics 2022 paper: Supporting Unified Shader Specialization by Co-opting C++ Features

Language:C++License:NOASSERTIONStargazers:14Issues:18Issues:0

SlabAlloc

A dynamic GPU memory allocator, suitable for warp synchronized scenarios.

Language:CudaLicense:Apache-2.0Stargazers:11Issues:4Issues:4

ml_perf_model

ML performance model for GPU training of DLRM and more.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:10Issues:17Issues:11

push-pull

Code for paper "Implementing Push-Pull Efficiently in GraphBLAS" accepted to ICPP 2018

Language:C++License:Apache-2.0Stargazers:10Issues:27Issues:2

csgm

CUDA implementation of seeded graph matching

GPUQuotientFilters

Implementations of two types of quotient filters using GPUs

Language:CudaLicense:Apache-2.0Stargazers:5Issues:17Issues:0

optix_splats

Testing different OIT methods for Gaussian Splatting

sparsify.me

A simple C++14 and CUDA-based header-only library with tools for sparse-machine learning.

sssp

Single-Source Shortest Path (SSSP) implementation in modern C++ for 2022 IPDPS workshop on Graphs, Architectures, Programming, and Learning (GrAPL 2022) submission.

Language:C++License:Apache-2.0Stargazers:4Issues:1Issues:0

GPUMaximumClique

A maximum clique solver for GPUs

Language:CudaLicense:Apache-2.0Stargazers:3Issues:18Issues:0

RXMeshTemplate

A template showing how to use RXMesh

Language:CMakeStargazers:3Issues:0Issues:0

hiptracer

Capture and / or instrument HIP API calls

Language:CStargazers:2Issues:17Issues:0

rtx_nerf

An implementation of NeRF acceleration using RTX cores to compute ray-grid intersections

Language:CLicense:GPL-3.0Stargazers:2Issues:17Issues:7

Osama-Exit-Seminar

Muhammad Osama's exit seminar slides and abstract.

Language:TeXStargazers:1Issues:3Issues:0

pytorch_block_sparse

Fast Block Sparse Matrices for Pytorch

Language:C++License:NOASSERTIONStargazers:1Issues:1Issues:0

TrafficSignBench

A benchmark for deep learning frameworks on traffic sign classification/detection task on GPU and FPGA

Language:PythonStargazers:1Issues:26Issues:0

application_classification

CUDA implementation of application classification via belief propagation

Language:CudaStargazers:0Issues:6Issues:2
Language:PythonStargazers:0Issues:2Issues:0

gunrock

High-Performance Graph Primitives on GPUs

Language:CudaLicense:Apache-2.0Stargazers:0Issues:28Issues:0

NRLib

Neural Rendering Library

Language:CudaStargazers:0Issues:1Issues:0