Jake Hemstad (jrhemstad)

jrhemstad

Geek Repo

Company:@NVIDIA

Location:Minneapolis, MN

Github PK Tool:Github PK Tool

Jake Hemstad's repositories

two_largest

Adventure in profiling and optimization.

Language:C++License:Apache-2.0Stargazers:7Issues:3Issues:0

cuda_scalar_result

Answering "What is the faster way to return a single scalar from a kernel to host?"

Language:CMakeLicense:Apache-2.0Stargazers:6Issues:3Issues:0

example_cuda_benchmark

Template repository for CUDA enabled benchmarks using Google Benchmark

Language:CMakeLicense:Apache-2.0Stargazers:6Issues:2Issues:0
Language:ShellStargazers:4Issues:2Issues:0

nvtx_wrappers

This repository is deprecated and the code has moved to the official NVIDIA NVTX github repository: https://github.com/NVIDIA/NVTX

Language:C++License:Apache-2.0Stargazers:2Issues:2Issues:4

creduce-example

Examples on how to use C-Reduce to create minimal compiler bug reproducers

Language:ShellLicense:Apache-2.0Stargazers:1Issues:2Issues:0

link_test

Testing linkage of function local statics

Language:C++Stargazers:1Issues:2Issues:0

stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

Language:C++License:Apache-2.0Stargazers:1Issues:0Issues:0
Stargazers:0Issues:0Issues:0

accelerated-computing-hub

NVIDIA curated collection of educational resources related to general purpose GPU programming.

License:NOASSERTIONStargazers:0Issues:0Issues:0

cccl

CUDA C++ Core Libraries

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

compiler-explorer

Run compilers interactively from your web browser and interact with the assembly

Language:AssemblyLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

cub

Cooperative primitives for CUDA C++.

Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

cuda-api-wrappers

Thin C++-flavored wrappers for the CUDA Runtime API

Language:C++License:BSD-3-ClauseStargazers:0Issues:1Issues:0

cuda-quantum

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

cudf

Python GPU DataFrame Library

Language:CudaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:ShellStargazers:0Issues:0Issues:0

gil_preload

Add NVTX ranges to Python GIL

Language:C++Stargazers:0Issues:2Issues:0
Stargazers:0Issues:0Issues:0

infra

Infrastructure to set up the public Compiler Explorer instances and compilers

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

libcudacxx

The NVIDIA C++ Standard Library

Language:C++Stargazers:0Issues:1Issues:0

llm.c

LLM training in simple, raw C/CUDA

License:MITStargazers:0Issues:0Issues:0

nvbench

CUDA Kernel Benchmarking Library

Language:CudaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

NVTX

The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.

Language:CLicense:Apache-2.0Stargazers:0Issues:1Issues:0

rmm

RAPIDS Memory Manager

Language:C++License:Apache-2.0Stargazers:0Issues:3Issues:0
Stargazers:0Issues:1Issues:0

thrust

Thrust is a C++ parallel programming library which resembles the C++ Standard Library.

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0