Gabriel Dos Santos's repositories
lattice-boltzmann-method
Optimization of a LBM using hybrid parallelization (MPI and OpenMP) and agressive intrinsics vectorization.
in512-systems-programming
Lab corrections for the 3rd year IN512 - Systems Programming course at the University of Versailles - Saint-Quentin-en-Yvelines (UVSQ)
CUDA-dgemm
Simple CUDA dgemm for row-major, non-transposed layouts.
master-thesis
Master's thesis on Rust and GPU programming at CEA/Paris-Saclay University
parallel-architectures
Labs for class Parallel Architectures in M1 HPCS at Paris-Saclay University.
interpol
Interposition library to trace and profile non-blocking MPI calls.
kokkos-comm
Unofficial MPI Wrapper for Kokkos
stencil-cgg
Highly optimized 3D-stencil from CGG.
sve-string-routines-benchmarks
Comparative performance benchmarks for implementations of Arm SVE optimized string routines.
advanced-numerical-methods
Labs for the Advanced Programming of Numerical Methods course of the HPCS master at University Paris-Saclay.
aoc-23-cpp
Advent of Code 2023 in C++
arm-deinterleaving-loads
Benchmarking Arm SIMD de-interleaving loads against scalar instructions.
cpc
Text calculator with support for units and conversion
CUDA-image-processing
Simple image processing filters for both CPU and NVIDIA GPUs
dssgabriel
My personal repo
eurocc_cfd
CFD code in Rust, C, and Fortran
helix
A post-modern modal text editor.
internship_gratifications
Outil de calcul du nombre d'heures de travail et de la gratification résultante
learning-sycl
Learning SYCL with Intel DPC++
llama.cpp
Port of Facebook's LLaMA model in C/C++
molecular-simulation
Introduction to Molecular Simulation course project at Paris-Saclay University's HPCS master
obhpc
Labs for class Basic Tools of HPC in M1 HPCS at Paris-Saclay University.
optimized-routines
Optimized implementations of various library functions for ARM architecture processors
TeenAstro
an easy to build and use telescope controller