andrei-pokrovsky

PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.

MIT000

perftest

Infiniband Verbs Performance Tests

NOASSERTION000

pmem-perf-sweep

000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

NOASSERTION000

pytorch-extension

an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors

GPL-3.0000

pytorch-lamb

Implementation of https://arxiv.org/abs/1904.00962

MIT000

radiation-benchmarks

Benchmarks used for radiation tests

000

security-smell-detector-python-gist

000

smem

Smem memory reporting tool for Python 3

GPL-2.0000

TensorRT

TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.

Apache-2.0000

tlsf

Two-Level Segregated Fit memory allocator implementation.

000

torch-blocksparse

Block-sparse primitives for PyTorch

MIT000

torch2trt

An easy to use PyTorch to TensorRT converter

MIT000

torchscript-to-tvm

000

triton

Development repository for the Triton language and compiler

NOASSERTION000

xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

MIT000

XNNPACK

NOASSERTION000