Nicholas Malaya (nicholasmalaya)

nicholasmalaya

Geek Repo

Company:AMD

Location:Austin, TX

Home Page:nicholasmalaya.github.io/

Twitter:@nicholasmalaya

Github PK Tool:Github PK Tool


Organizations
AMD-HPC
AMD-RIPS
libqueso
manufactured-solutions
mfem
secondfoundation

Nicholas Malaya's starred repositories

Language:CLicense:NOASSERTIONStargazers:24Issues:0Issues:0

OpenFOAM_HMM

Refactoring OpenFOAM with OpenMP target offloading and use of HMM to offload work onto GPUs

Language:C++License:NOASSERTIONStargazers:17Issues:0Issues:0

hipTT

HIP port of the fast GPU tensor transpose library cuTT

Language:C++Stargazers:4Issues:0Issues:0
Language:CStargazers:17Issues:0Issues:0
Language:PythonLicense:MITStargazers:7Issues:0Issues:0

rccl-tests

RCCL Performance Benchmark Tests

Language:CudaLicense:NOASSERTIONStargazers:38Issues:0Issues:0

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:4494Issues:0Issues:0

composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

Language:C++License:NOASSERTIONStargazers:264Issues:0Issues:0

Quicksilver

A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037

Language:C++License:NOASSERTIONStargazers:2Issues:0Issues:0

CAMP

A synthetic micro-benchmark for assessing deep memory hierarchies

Language:CLicense:Apache-2.0Stargazers:4Issues:0Issues:0

olcf-user-docs

Sources for the Oak Ridge Leadership Computing Facility User Documentation

Language:PythonStargazers:56Issues:0Issues:0
Language:CLicense:Apache-2.0Stargazers:12Issues:0Issues:0

rocHPL

High Performance Linpack for Next-Generation AMD HPC Accelerators

Language:C++License:NOASSERTIONStargazers:40Issues:0Issues:0

toast

Time Ordered Astrophysics Scalable Tools

Language:C++License:NOASSERTIONStargazers:43Issues:0Issues:0

hpc

Reference implementations of MLPerf™ HPC training benchmarks

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:37Issues:0Issues:0

alphafold

Open source code for AlphaFold.

Language:PythonLicense:Apache-2.0Stargazers:12154Issues:0Issues:0

rocHPCG

HPCG benchmark based on ROCm platform

Language:C++License:BSD-3-ClauseStargazers:35Issues:0Issues:0

hiop

HPC solver for nonlinear optimization problems

Language:C++License:NOASSERTIONStargazers:208Issues:0Issues:0

sysconfidence

System Confidence - a system latency analysis benchmark

Language:CLicense:NOASSERTIONStargazers:8Issues:0Issues:0
Language:C++License:MITStargazers:8Issues:0Issues:0
Language:Jupyter NotebookStargazers:9Issues:0Issues:0

ECP-ST-CAR-PUBLIC

The Exascale Computing Project Software Technologies Capability Assessment Report - Public Version

Language:TeXLicense:BSD-2-ClauseStargazers:20Issues:0Issues:0

fml

Fused Matrix Library

Language:C++License:BSL-1.0Stargazers:24Issues:0Issues:0

benchmarks

A benchmark framework for Tensorflow

Language:PythonLicense:Apache-2.0Stargazers:1140Issues:0Issues:0

Caliper

Caliper is an instrumentation and performance profiling library

Language:C++License:BSD-3-ClauseStargazers:340Issues:0Issues:0

Random123

HIP port of Random123 library.

Language:C++License:NOASSERTIONStargazers:2Issues:0Issues:0

roctracer

ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs

Language:C++License:NOASSERTIONStargazers:65Issues:0Issues:0

gidiplus

C++ libraries for accessing nuclear data from the Generalized Nuclear Database Structure (GNDS)

Language:C++License:MITStargazers:12Issues:0Issues:0

staidy

Code for CFDNet and SURFNet

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

HIP-CPU

An implementation of HIP that works on CPUs, across OSes.

Language:C++License:MITStargazers:107Issues:0Issues:0