ParCoreLab

ParCoreLab

Organization data from Github https://github.com/ParCoreLab

Koç University - Parallel and Multicore Computing Laboratory

Location:Istanbul

Home Page:https://parcorelab.ku.edu.tr/

GitHub:@ParCoreLab

Twitter:@didemunat

ParCoreLab's repositories

Snoopie

Multi-GPU communication profiler and visualizer

Language:CLicense:NOASSERTIONStargazers:32Issues:4Issues:5

ComScribe

ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.

Language:C++License:BSD-3-ClauseStargazers:26Issues:0Issues:1

CPU-Free-model

Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involvement of the CPU beyond the initial kernel launch.

Language:CudaLicense:MITStargazers:21Issues:3Issues:4

ReuseTracker

A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.

mixed-and-multi-spmv

Mixed and Multi-Precision SpMV for GPUs with Row-wise Precision Selection.

Language:CudaLicense:MITStargazers:6Issues:5Issues:2

SpTRSV_Framework

The SpTRSV prediction framework is an automated prediction framework for the fastest sparse triangular solve (SpTRSV) algorithm for a given input sparse matrix on a CPU-GPU platform.

Language:C++License:NOASSERTIONStargazers:6Issues:0Issues:0

Split_SpTRSV

The split execution framework can automatically determine the suitability of an SpTRSV for split-execution, find the appropriate split point, and execute SpTRSV in a split fashion using two SpTRSV algorithms while automatically managing any required inter-platform communication. The model is implemented as a C++/CUDA library supporting multiple CPU-GPU algorithms.

Language:C++License:NOASSERTIONStargazers:4Issues:0Issues:0

BeyondMoore

BeyondMoore has an ambitious goal to develop a software framework that performs static and dynamic optimizations, issues accelerator-initiated data transfers, and reasons about parallel execution strategies that exploit both processor and memory heterogeneity.

aCG

GPU-accelerated linear solvers based on the conjugate gradient (CG) method, supporting NVIDIA and AMD GPUs with GPU-aware MPI, NCCL, RCCL or NVSHMEM

Language:CLicense:MITStargazers:1Issues:0Issues:0

gpu-fusion

GPU fusion code and algorithm

Language:CudaLicense:MITStargazers:1Issues:1Issues:0
Language:C++Stargazers:1Issues:1Issues:0

.github

Homepage README.

Stargazers:0Issues:1Issues:0

accuracy-verification-microbenchmarks

The microbenchmarks that are used to verify the accuracy of ComDetective.

Language:MakefileStargazers:0Issues:1Issues:0
Language:C++Stargazers:0Issues:3Issues:0

CPU-Free-Model-Compiler

DaCe - Data Centric Parallel Programming

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

hpctoolkit-externals

HPCToolkit performance tools: essential third party libraries for hpctoolkit

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:TypeScriptStargazers:0Issues:2Issues:0
Language:CStargazers:0Issues:0Issues:0

AMD_IBS_Toolkit

AMD Research Instruction Based Sampling Toolkit

Language:CStargazers:0Issues:1Issues:0
Language:CStargazers:0Issues:2Issues:0
Language:CLicense:NOASSERTIONStargazers:0Issues:2Issues:0

hpctoolkit

HPCToolkit performance tools: measurement and analysis components

Language:C++Stargazers:0Issues:0Issues:0

snoopie-ucx-tracking-ucx

Modified ucx library to track communications

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

splash2

Splash 2 Benchmarks

Language:CStargazers:0Issues:0Issues:0

Uniconn

Uniconn is a unified, portable high-level C++ communication library that supports both point-to-point and collective operations across GPU clusters. Uniconn enables seamless switching between backends and APIs (host or device) with minimal or no changes to application code.

Language:CudaLicense:MITStargazers:0Issues:0Issues:0