ZhaoZhixiang-HEU's repositories

axc_axt_gpu_spmv

This repo contains the code to perform the SpMV product with the CSR, K1, AXC, and AXT formats. Using CUDA instructions and the CUSPARSE library.

Language:CStargazers:0Issues:0Issues:0

axc_axt_inter_spmv

This repo contains the code to perform the SpMV for the CSR, AXC, K1, and AXT formats using intrinsic instructions and the MKL library.

Language:CStargazers:0Issues:0Issues:0

Benchmark_SpMV_using_CSR

CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)

Language:C++License:MITStargazers:0Issues:0Issues:0

clSPARSE

a software library containing Sparse functions written in OpenCL

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

csr2_spmv_pb

A New Format for SIMD-accelerated SpMV

Language:C++License:MITStargazers:0Issues:0Issues:0

csr5spmv_pb

CSR5-based SpMV on CPUs, GPUs and Xeon Phi

Language:C++License:MITStargazers:0Issues:0Issues:0

cudaSpmv

CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format

Language:C++Stargazers:0Issues:0Issues:0

CUMP

The CUDA Multiple Precision Arithmetic Library

Language:CLicense:GPL-3.0Stargazers:0Issues:0Issues:0

cvr_spmv_pb

Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)

Language:C++License:MITStargazers:0Issues:0Issues:0
Language:CudaLicense:MITStargazers:0Issues:0Issues:0

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

mixed_multi_spmv_pb

Mixed and Multi-Precision SpMV for GPUs with Row-wise Precision Selection.

License:MITStargazers:0Issues:0Issues:0

s-blas

This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Triangular-Solve (SpTRSV), Sparse-Matrix-Transposition (SpTrans) and Sparse-Matrix-Matrix-Multiplication (SpMM) for Single-node Multi-GPU (scale-up) platforms such as NVIDIA DGX-1 and DGX-2.

Language:C++License:MITStargazers:0Issues:0Issues:0
Language:CudaStargazers:0Issues:0Issues:0

SparseP

SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) architectures. SparseP is developed to evaluate and characterize the first publicly-available real-world PIM architecture, the UPMEM PIM architecture. Described by C. Giannoula et al. [https://arxiv.org/abs/2201.05072]

Language:CLicense:MITStargazers:0Issues:0Issues:0

sparsex

The SparseX sparse kernel optimization library

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0

spgpu

spGPU library for sparse linear algebra on GPUs

Language:CudaLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

SpMP

sparse matrix pre-processing library

License:NOASSERTIONStargazers:0Issues:0Issues:0

spmv

This is a tuned sparse matrix dense vector multiplication(SpMV) library

Language:CLicense:GPL-3.0Stargazers:0Issues:0Issues:0

ssget

Command line tool for working with matrices from the SuiteSparse Matrix Collection (sparse.tamu.edu)

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ssgetpy

A searchable Python interface to the SuiteSparse Matrix Collection

Language:PythonStargazers:0Issues:0Issues:0

TileSpGEMM

Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Yuyao Niu, Zhengyang Lu, Haonan Ji, Shuhui Song, Zhou Jin, and Weifeng Liu.

Language:CStargazers:0Issues:0Issues:0

tilespmv_pb

Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang Lu, Meichen Dong, Zhou Jin, Weifeng Liu, and Guangming Tan.

Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0