Andrew Kerr's repositories
AITemplate
AITemplate is a Python framework which render neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:PythonApache-2.0000
cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
Language:C++MIT000
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++BSD-3-Clause000
MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
Language:C++NOASSERTION000
notcurses
blingful character graphics/TUI library. definitely not curses.
Language:CNOASSERTION000
omphalos
A tool for network enumeration and domination.
Language:CGPL-3.0000
SHARK-Runtime
Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark
Language:C++Apache-2.0000