openacc

There are 8 repositories under openacc topic.

llama2.c
trholding / llama2.c
Llama 2 Everywhere (L2E)
ape cosmopolitan multios llama2c accelerate blas blis c cblas clblast llama2 llm mkl openacc openblas openmp unikernel armpl linux-kernel os
Language:C 1380
openhackathons-org / gpubootcamp
This repository consists for gpu bootcamp material for HPC and AI
ai4hpc cuda data-science deep-learning deepstream gpu hpc machine-learning mpi openacc openmp rapidsai
Language:Jupyter Notebook 495
ParRes / Kernels
This is a set of simple programs that can be used to explore the features of a parallel platform.
parallel-programming c c-plus-plus mpi fortran2008 python3 julia pgas openmp shmem coarray-fortran threading kokkos opencl sycl hpc parallel openacc
Language:C 401
pyccel / pyccel
Python extension language using accelerators
hpc mpi openmp fortran dsl sympy openacc python python3 transpiler
Language:Python 330
alpaka-group / alpaka
Abstraction Library for Parallel Kernel Acceleration :llama:
cuda hpc gpu rocm hip openmp openacc heterogeneous-parallel-programming cpp header-only tbb cpp17
Language:C++ 321
BabelStream
UoB-HPC / BabelStream
STREAM, for lots of devices written in many programming models
memory-bandwidth benchmark parallel-processing gpgpu opencl cuda kokkos raja sycl openmp openacc hpc gpu
Language:C++ 309
eric2003 / OneFLOW
LargeScale Multiphysics Scientific Simulation Environment-OneFLOW CFD
cfd c-plus-plus fluid fluid-dynamics navier-stokes cgns multiphysics turbulence parallel mpi hdf5 simulation gpu openacc cuda
Language:C++ 240
ROCm / gpufort
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
gpgpu gpu cuda rocm hip fortran cuda-fortran openacc openmp interoperability
Language:Fortran 159
MFlowCode / MFC
Exascale multiphase flow simulation
multiphase-flow computational-fluid-dynamics gpu openacc hpc-applications exascale fortran amdgpu instinct nvidia-gpu
Language:Fortran 119
OpenACCUserGroup / openacc-users-group
openacc parallel-computing training-materials codes
Language:C 80
OpenACC / openacc-training-materials
Training materials provided by OpenACC.org.
cplusplus fortran gpu openacc
Language:C 73
jeng1220 / openacc_fortran_examples
Simple OpenACC Fortran Examples
openacc fortran cuda
Language:Fortran 49
jefflarkin / openacc-interoperability
Interoperability examples for OpenACC.
cuda fortran gpu openacc
Language:C 46
claw-project / claw-compiler
CLAW Compiler for Performance Portability
compiler omni-compiler claw-language source-to-source java hpc openacc openmp c accelerator fortran fortran-compiler directives xcodeml-translator transformations translator code-transformation transpiler language
Language:Java 40
predsci / POT3D
POT3D: High Performance Potential Field Solver
cuda-aware-mpi do-concurrent gpu multi-gpu openacc potential-fields stdpar
Language:Fortran 39
usnistgov / hiperc
High Performance Computing Strategies for Boundary Value Problems
phase-field gpgpu xeon-phi materials-science computational-science cuda openacc diffusion-equation scientific-computing diffusion gpu gpu-computing finite-difference shared-memory-parallel
Language:HTML 37
Hopobcn / FWI
RTM
fwi openacc cuda cmake bsc
Language:C 36
elphbolt
nakib / elphbolt
A solver for the coupled and decoupled electron and phonon Boltzmann transport equations.
boltzmann-transport drag-effect electron-phonon-coupling phonon-phonon-coupling coarray-fortran ab-initio-simulations thermal-conductivity charge-conductivity thermoelectricity gpu-acceleration openacc opencoarrays
Language:Fortran 36
jnbntz / gpu-edu-workshops
Code examples for CUDA and OpenACC
cuda gpu openacc
Language:Cuda 35
openhackathons-org / nways_accelerated_programming
N-Ways to GPU Programming Bootcamp
cuda gpu hpc openacc openmp nsight-systems standard-language-parallelism
Language:Jupyter Notebook 34
intel / intel-application-migration-tool-for-openacc-to-openmp
OpenACC* to OpenMP* API assisting migration tool
openacc openmp
Language:Python 29
OpenACC / openacc-best-practices-guide
The sources for the OpenACC Programming and Best Practices Guide.
cplusplus fortran gpu openacc
Language:TeX 28
MaxStrange / pyACC
OpenACC for Python
openacc parallelize python
Language:Python 20
ShadyBoukhary / GPU-research-FFT-OpenACC-CUDA
Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the development of new skills and the formation of new knowledge. This research studies the behavior and performance of two interdisciplinary and widely adopted scientific kernels, a Fast Fourier Transform and Matrix Multiplication. Both routines are implemented in the two current most popular many-core programming models CUDA and OpenACC. A Fast Fourier Transform (FFT) samples a signal over a period of time and divides it into its frequency components, computing the Discrete Fourier Transform (DFT) of a sequence. Unlike the traditional approach to computing a DFT, FFT algorithms reduce the complexity of the problem from O(n2) to O(nLog2n). Matrix multiplication is a cornerstone routine in Mathematics, Artificial Intelligence and Machine Learning. This research also shows that the nature of the problem plays a crucial role in determining what many-core model will provide the highest benefit in performance.
fft cuda openacc gpu-programming gpu-acceleration gpu-computing pgi-compiler pgi nvcc parallel-computing acceleration fast-fourier-transform radix-2
Language:Cuda 10
capellil / IHPCSS_Programming_challenge_2019
The repository containing everything you need to compete in the IHPCSS 2019 programming challenge.
ihpcss challenge hpc mpi mpi-openmp mpi-openacc openmp openacc
Language:Fortran 9
szaghi / FUNDAL
Fortran UNified Device Acceleration Library
fortran gpu hpc openacc openmp parallel-computing
Language:Fortran 8
wazedxwxx / IBTFO
Immersed Boundary method fast Test Facility based OpenAcc
c-plus-plus cfd finite-difference gpu-acceleration immersed-boundary-method openacc
Language:C++ 8
dc-fukuoka / jacobi
jacobi - a benchmark by solving 2D laplace equation with jacobi iterative method. GPU or Xeon Phi can be used.
fortran openmp openacc mpi high-performance-computing xeon-phi gpu-computing benchmark jacobi-relaxation
Language:Fortran 7
mnicely / computeWorks_examples
Matrix multiplication example performed with OpenMP, OpenACC, BLAS, cuBLABS, and CUDA
openmp openacc blas cublas cuda docker nvidia-docker nvidia pgi-compiler nsight eclipse-plugin
Language:C++ 6
PawseySC / sc20-gpu-offloading
Materials for "Differences between OpenACC and OpenMP offloading models" tutorial.
openacc openmp gpu-programming gpu-acceleration laplace-equation tutorial
Language:C 6
stfc / PSycloneBench
Various benchmarks used to inform PSyclone optimisations
benchmark fortran gpu-acceleration kokkos mpi openacc opencl openmp optimization
Language:Fortran 6
eafit-apolo / 2DPartInt
Soil particles contact simulation
geomechanics civil-engineering hpc gpu openacc
Language:C 5
olcf-tutorials / openmp_offloading
OpenMP programming tips for GPU offloading
cuda cuda-fortran gpu openacc openmp
Language:C++ 5
OpenACC / openacc-interoperability-examples
Interoperability examples for OpenACC.
cplusplus cuda fortran gpu openacc
Language:C 5
tan2 / geoflac
Code for lithospheric scale geodynamics
openmp openacc f90
Language:Fortran 5
yasahi-hpc / P3-miniapps
Kinetic plasma simulation code parallelized with C++ parallel algorithm
gpu high-performance-computing kokkos mpi openacc openmp stdpar
Language:C++ 4

openacc

trholding / llama2.c

openhackathons-org / gpubootcamp

ParRes / Kernels

pyccel / pyccel

alpaka-group / alpaka

UoB-HPC / BabelStream

eric2003 / OneFLOW

ROCm / gpufort

MFlowCode / MFC

OpenACCUserGroup / openacc-users-group

OpenACC / openacc-training-materials

jeng1220 / openacc_fortran_examples

jefflarkin / openacc-interoperability

claw-project / claw-compiler

predsci / POT3D

usnistgov / hiperc

Hopobcn / FWI

nakib / elphbolt

jnbntz / gpu-edu-workshops

openhackathons-org / nways_accelerated_programming

intel / intel-application-migration-tool-for-openacc-to-openmp

OpenACC / openacc-best-practices-guide

MaxStrange / pyACC

ShadyBoukhary / GPU-research-FFT-OpenACC-CUDA

capellil / IHPCSS_Programming_challenge_2019

szaghi / FUNDAL

wazedxwxx / IBTFO

dc-fukuoka / jacobi

mnicely / computeWorks_examples

PawseySC / sc20-gpu-offloading

stfc / PSycloneBench

eafit-apolo / 2DPartInt

olcf-tutorials / openmp_offloading

OpenACC / openacc-interoperability-examples

tan2 / geoflac

yasahi-hpc / P3-miniapps