Andrei Pokrovsky (andrei-pokrovsky)

andrei-pokrovsky

Geek Repo

Company:Uber

Github PK Tool:Github PK Tool

Andrei Pokrovsky's repositories

BERT-ONNX

BERT ONNX PRE/POST - OPTIMIZATION

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

ConcurrentDeque

Fast, generalized, implementation of the Chase-Lev lock-free work-stealing deque for C++17

License:MPL-2.0Stargazers:0Issues:0Issues:0

CudaSharedPtr

Shared Pointer for Cuda Device Pointers and Cuda Streams, Smart Wrapper to Allocate and Deallocate Cuda Device Buffer.

License:MITStargazers:0Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

License:MITStargazers:0Issues:0Issues:0

doroce-linux

A command line utility to manage the configuration of a system's high performance network interfaces for RoCE deployments

License:MITStargazers:0Issues:0Issues:0

keyboard-layout-converter

A simple python script to convert a Windows .klc keyboard layout to a Linux .xkb file

License:GPL-3.0Stargazers:0Issues:0Issues:0

likwid

Performance monitoring and benchmarking suite

License:GPL-3.0Stargazers:0Issues:0Issues:0

mpi_test

Examples and tests for MPI+CUDA with CMake

Stargazers:0Issues:0Issues:0

MuZero

An Implementation of MuZero in PyTorch and Ray for reversi

Stargazers:0Issues:0Issues:0

nccl-tests

NCCL Tests

Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

necklace

Distributed deep learning framework based on pytorch/mxnet/numba and nccl.

License:MITStargazers:0Issues:0Issues:0

oneCCL

oneAPI Collective Communications Library (oneCCL)

License:NOASSERTIONStargazers:0Issues:0Issues:0

onnx-opcounter

Count number of parameters / MACs / FLOPS for ONNX models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

param

PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.

License:MITStargazers:0Issues:0Issues:0

perftest

Infiniband Verbs Performance Tests

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

License:NOASSERTIONStargazers:0Issues:0Issues:0

pytorch-extension

an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors

License:GPL-3.0Stargazers:0Issues:0Issues:0

pytorch-lamb

Implementation of https://arxiv.org/abs/1904.00962

License:MITStargazers:0Issues:0Issues:0

radiation-benchmarks

Benchmarks used for radiation tests

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

smem

Smem memory reporting tool for Python 3

License:GPL-2.0Stargazers:0Issues:0Issues:0

TensorRT

TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.

License:Apache-2.0Stargazers:0Issues:0Issues:0

tlsf

Two-Level Segregated Fit memory allocator implementation.

Stargazers:0Issues:0Issues:0

torch-blocksparse

Block-sparse primitives for PyTorch

License:MITStargazers:0Issues:0Issues:0

torch2trt

An easy to use PyTorch to TensorRT converter

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

License:NOASSERTIONStargazers:0Issues:0Issues:0

xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

License:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0