There are 43 repositories under gpgpu topic.
John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
A .NET library to run C# code in parallel on the GPU through DX12, D2D1, and dynamically generated HLSL compute and pixel shaders, with the goal of making GPU computing easy to use for all .NET developers! 🚀
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
A GPU accelerated image and video processing framework built on Metal.
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
A modular and open-ended toolkit for WebGPU, with advanced type inference and the ability to write shaders in TypeScript
A GPU-accelerated computing library for running physics simulations and other GPGPU computations in a web browser.
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
Fast Clojure Matrix Library
BitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Thin, unified, C++-flavored wrappers for the CUDA APIs
OpenCL for Rust
A GPU Particle System for Unity ✨capable of simulating and rendering millions of particles at once 💥
Minimal Example of Using Vulkan for Compute Operations. Only ~400LOC.
Vulkan, OptiX and CUDA Interoperation Modular Rendering Library and Framework for PC/Linux/Android
This is a list of useful libraries and resources for CUDA development.