jrhemstad

followers

0

following

stars

@NVIDIA

Minneapolis, MN

Jake Hemstad's repositories

two_largest

Adventure in profiling and optimization.

Language:C++Apache-2.07 30

cuda_scalar_result

Answering "What is the faster way to return a single scalar from a kernel to host?"

Language:CMakeApache-2.06 30

example_cuda_benchmark

Template repository for CUDA enabled benchmarks using Google Benchmark

Language:CMakeApache-2.06 20

cuda_arch_odr

Language:Shell4 20

nvtx_wrappers

This repository is deprecated and the code has moved to the official NVIDIA NVTX github repository: https://github.com/NVIDIA/NVTX

Language:C++Apache-2.02 2 4

creduce-example

Examples on how to use C-Reduce to create minimal compiler bug reproducers

Language:ShellApache-2.01 20

link_test

Testing linkage of function local statics

Language:C++1 20

stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

Language:C++Apache-2.0100

.github

000

accelerated-computing-hub

NVIDIA curated collection of educational resources related to general purpose GPU programming.

NOASSERTION000

cccl

CUDA C++ Core Libraries

Language:C++NOASSERTION000

compiler-explorer

Run compilers interactively from your web browser and interact with the assembly

Language:AssemblyBSD-2-Clause010

cub

Cooperative primitives for CUDA C++.

Language:CudaBSD-3-Clause000

cuCollections

Language:C++Apache-2.0010

cuda-api-wrappers

Thin C++-flavored wrappers for the CUDA Runtime API

Language:C++BSD-3-Clause010

cuda-quantum

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

Language:C++NOASSERTION000

cudf

Python GPU DataFrame Library

Language:CudaApache-2.0010

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++NOASSERTION000

devcontainers

Language:Shell000

gil_preload

Add NVTX ranges to Python GIL

Language:C++020

github-markdown

000

infra

Infrastructure to set up the public Compiler Explorer instances and compilers

Language:PythonBSD-2-Clause010

jrhemstad

010

libcudacxx

The NVIDIA C++ Standard Library

Language:C++010

llm.c

LLM training in simple, raw C/CUDA

MIT000

nvbench

CUDA Kernel Benchmarking Library

Language:CudaApache-2.0010

NVTX

The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.

Language:CApache-2.0010

rmm

RAPIDS Memory Manager

Language:C++Apache-2.0030

test_workflow_failure

010

thrust

Thrust is a C++ parallel programming library which resembles the C++ Standard Library.

Language:C++NOASSERTION010