Millad's repositories
CoMD-OpenACC
Implementation of CoMD with OpenACC 2.6 (PGI 18)
deepcopy-benchmark
A set of microbenchmarks for deep copy in directive-based programming models
Gecko-BabelStream
STREAM benchmark for Gecko - Using BabelStream repo.
gecko-rodinia
Gecko version of Rodinia benchmark
rapl-wrapper
RAPL wrapper for Intel Processors using MSRs
java-c-binding
Good resources for Java and C binding
python-c-extensions-tut
Python C Extensions Tutorial
acc-mini-bench
Some OpenACC mini-benchmarks
cuda-bandwidth
Measuring PCI-e bandwidth for devices supporting CUDA.
gecko-microbench
Microbenchmarks for Gecko
llama2.cpp
Inference Llama 2 in C++
micrograd
C++ version of Andrej Karpathy's tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
mypylib
Some useful Python libs
openuh
OpenUH Compiler - OMPT Support for OpenMP Runtime Library
openuh-libopenmp-ompt
OpenMP 3.0 Library with OMPT 1.0 Support for OpenUH Compiler
overrideCUDAFunctions
How to override CUDA functions with PRELOADing
pointerchain
The pointerchain Directive