There are 9 repositories under heterogeneous-parallel-programming topic.
A curated list of awesome parallel computing resources
Heterogeneous programming in Julia
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Abstraction Library for Parallel Kernel Acceleration :llama:
Heterogeneous Research Platform (HERO) for exploration of heterogeneous computers consisting of programmable many-core accelerators and an application-class host CPU, including full-stack software and hardware.
Concurrent CPU-GPU Programming using Task Models
GARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
OpenFPM: A scalable open framework for particle and particle-mesh codes on parallel computers
The Darmstadt Automotive Parallel HeterogeNEous (DAPHNE) Benchmark-Suite
⛔ DEPRECATED ⛔ HERO Software Development Kit
Intel LevelZero JNI library for TornadoVM
Platform-aware programming in Julia
OpenCL based lightweight c++ computing library
A library for simplified distributed computing across any heterogeneous architectures (cpu + gpu), with tensor support, and polymorphic data layouts for optimal performance! Ripple enables you to scale quickly without sacrificing performance!
High-Performance Tucker Factorization on Heterogeneous Platforms (GTA) - TPDS 2019
A tool for converting specific Julia GPU code writen in CUDA.jl, into abstract multi-backend code with KernelAbstractions.jl.
Builder script for Clang/LLVM10 compiler for Nvidia Jetson Nano (could be extended to other Jetson boards) with OpenMP 4.5 offloading support.
The Accelerator Integration Tool (AIT) automatically integrates OmpSs@FPGA accelerators into FPGA designs using different vendor backends
This version of crispy-DOOM uses 8 HW accelerators on the FPGA
We present an algorithm to dynamically adjust the data assigned for each worker at every epoch during the training in a heterogeneous cluster. We empirically evaluate the performance of the dynamic partitioning by training deep neural networks on the CIFAR10 dataset.
Heterogeneous Parallel implementation to solve the Connected Components problem using OpenMP, CUDA and OpenCL.
Weasel language is a project i created to proof of concept that we can support heterogeneous internally inside a language.
The Heterogeneous Queuing Framework utilizing Fibers
Ultra fast simulated annealing with OpenCL & multiple accelerators, GPUs, CPUs.
Projects done on STM32MP157C-DK2 Kit. Communication between Multiple Cores in mutithreaded environment using C/C++.
To accompany the paper "An efficient new static scheduling heuristic for accelerated architectures".