Phuong Nguyen's repositories
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
lammps-conp
Constant potential method in LAMMPS
llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
ompi
Open MPI main development repository
PlanetaryModels
Build 3D planetary models on deformable meshes
SWE
The Shallow Water Equations teaching code.
syclacademy
SYCL Academy, a set of learning materials for SYCL heterogeneous programming
ttg
TTG: Template Task Graph C++ API