Akif Aydogmus's starred repositories

cppbestpractices

Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

Language:CLicense:NOASSERTIONStargazers:5854Issues:117Issues:225

tiny-cuda-nn

Lightning fast C++/CUDA neural network framework

Language:C++License:NOASSERTIONStargazers:3576Issues:50Issues:376

libcudacxx

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

Language:C++License:NOASSERTIONStargazers:2295Issues:67Issues:94

cub

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Language:CudaLicense:BSD-3-ClauseStargazers:1661Issues:89Issues:281

code-samples

Source code examples from the Parallel Forall Blog

Language:HTMLLicense:BSD-3-ClauseStargazers:1209Issues:115Issues:25

MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

Language:C++License:BSD-3-ClauseStargazers:1163Issues:26Issues:166

dace

DaCe - Data Centric Parallel Programming

Language:PythonLicense:BSD-3-ClauseStargazers:480Issues:17Issues:338

nvbench

CUDA Kernel Benchmarking Library

Language:CudaLicense:Apache-2.0Stargazers:452Issues:18Issues:90

syclacademy

SYCL Academy, a set of learning materials for SYCL heterogeneous programming

Language:HTMLLicense:CC-BY-SA-4.0Stargazers:432Issues:27Issues:57

cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

Language:C++License:MITStargazers:379Issues:15Issues:42

gpuocelot

GPUOCelot: A dynamic compilation framework for PTX

Language:C++License:BSD-3-ClauseStargazers:276Issues:33Issues:106

Content

Links, slide decks and other material for conference & meetup talks, podcast appearances and publications.

CMake-Best-Practices

CMake Best Practices, by Packt Publishing

Language:CMakeLicense:MITStargazers:189Issues:11Issues:9

rocprofiler

ROC profiler library. Profiling with perf-counters and derived metrics.

Language:CLicense:NOASSERTIONStargazers:119Issues:26Issues:98

SYCL-Docs

SYCL Open Source Specification

Language:JavaScriptLicense:NOASSERTIONStargazers:108Issues:32Issues:223

roctracer

ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs

Language:C++License:NOASSERTIONStargazers:65Issues:22Issues:46

scattered

C++ Scattered Containers

Events

A simple proof-of-concept for C++11 event dispatching

Language:C++License:MITStargazers:53Issues:7Issues:4

BabelViscoFDTD

Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:49Issues:5Issues:5

cuDNN-sample

cuDNN sample codes provided by Nvidia

Language:C++License:MITStargazers:41Issues:2Issues:2

bookmarks

my bookmarks

License:Artistic-2.0Stargazers:36Issues:5Issues:0

SirMetal

A metal based game engine

Language:C++License:MITStargazers:32Issues:1Issues:1

vector_addition_tutorials

This repository stores all of the OLCF vector addition tutorials

Language:SwiftLicense:MITStargazers:11Issues:5Issues:0

cuDNN-API

Pure C++ high-level API for cuDNN.

Language:CudaLicense:MITStargazers:6Issues:3Issues:1

cunda

Nvidia CUDA + OpenCV for apps

Language:C++License:MITStargazers:1Issues:2Issues:0