Beast code in Giters

Marek Kolodziej's repositories

open-gpu-doc

Documentation of NVIDIA chip/hardware interfaces

Language:CMIT100

awesome-reMarkable

A curated list of projects related to the reMarkable tablet

CC0-1.0000

brevitas

Brevitas: quantization-aware training in Pytorch

Language:PythonNOASSERTION000

Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben Ashbaugh, James Brodman, Michael Kinsner, John Pennycook, Xinmin Tian (Apress, 2020).

Language:CMakeNOASSERTION000

direwolf

Dire Wolf is a software "soundcard" AX.25 packet modem/TNC and APRS encoder/decoder. It can be used stand-alone to observe APRS traffic, as a tracker, digipeater, APRStt gateway, or Internet Gateway (IGate). For more information, look at the bottom 1/4 of this page and in https://github.com/wb2osz/direwolf/blob/dev/doc/README.md

Language:CGPL-2.0000

Get_Moving_With_Alveo

For publishing the source for UG1352 "Get Moving with Alveo"

Language:C++000

gradient-checkpointing

Make huge neural nets fit in memory

Language:PythonMIT000

iree

👻

Language:C++Apache-2.0000

MinkowskiEngine

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Language:PythonNOASSERTION000

nandland

All code found on nandland is here. underconstruction.gif

Language:Verilog000

NVBit

000

nvidia_libs_test

Tests and benchmarks for cudnn (and in the future, other nvidia libraries)

Language:C++Apache-2.0000

psp

Language:CApache-2.0000

raytracinginoneweekendincuda

The code for the ebook Ray Tracing in One Weekend by Peter Shirley translated to CUDA by Roger Allen. This work is in the public domain.

Language:C++000

rpi-gpio-dma-demo

Performance writing to GPIO with CPU and DMA on the Raspberry Pi

Language:C000

rules_cuda

Starlark implementation of bazel rules for CUDA.

Language:StarlarkMIT000

rules_cuda_examples

This repo holds the extended examples for rules_cuda.

Language:Starlark000

spconv

Spatial Sparse Convolution Library

Language:PythonApache-2.0000

TensorRT

TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.

Language:C++Apache-2.0000

torch2trt

An easy to use PyTorch to TensorRT converter

Language:PythonMIT000

torch_custom_op

Language:Cuda010

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:CudaApache-2.0000

Vitis-AI-Tutorials

000

Vitis-Tutorials

Language:C++NOASSERTION000

Vitis_Accel_Examples

Language:C++NOASSERTION000

mkolod

Marek Kolodziej's repositories

fast_upsampling

nimble

open-gpu-doc

tensorrt_python_samples

awesome-reMarkable

bazel-examples

bdf

brevitas

data-parallel-CPP

direwolf

Get_Moving_With_Alveo

gradient-checkpointing

iree

MinkowskiEngine

nandland

NVBit

nvidia_libs_test

psp

raytracinginoneweekendincuda

rpi-gpio-dma-demo

rules_cuda

rules_cuda_examples

spconv

TensorRT

torch2trt

torch_custom_op

TransformerEngine

Vitis-AI-Tutorials

Vitis-Tutorials

Vitis_Accel_Examples