a.saenko's starred repositories

atomic_queue

C++ lockless queue.

Language:C++License:MITStargazers:1404Issues:0Issues:0

HVM

A massively parallel, optimal functional runtime in Rust

Language:CudaLicense:Apache-2.0Stargazers:10247Issues:0Issues:0

nvbench

CUDA Kernel Benchmarking Library

Language:CudaLicense:Apache-2.0Stargazers:438Issues:0Issues:0

tensornetwork.org

Source for The Tensor Network open-source review article

Language:TeXLicense:Apache-2.0Stargazers:142Issues:0Issues:0

v2rayNG

A V2Ray client for Android, support Xray core and v2fly core

Language:KotlinLicense:GPL-3.0Stargazers:32054Issues:0Issues:0

antigen

The plugin manager for zsh.

Language:ShellLicense:MITStargazers:7943Issues:0Issues:0

polymer

Bridging polyhedral analysis tools to the MLIR framework

Language:C++License:MITStargazers:97Issues:0Issues:0

Polygeist

C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!

Language:C++License:NOASSERTIONStargazers:437Issues:0Issues:0

FXdiv

C99/C++ header-only library for division via fixed-point multiplication by inverse

Language:C++License:MITStargazers:44Issues:0Issues:0

taskflow

A General-purpose Task-parallel Programming System using Modern C++

Language:C++License:NOASSERTIONStargazers:9728Issues:0Issues:0

cutt

CUDA Tensor Transpose (cuTT) library

Language:C++Stargazers:9Issues:0Issues:0

tpp-mlir

TPP experimentation on MLIR for linear algebra

Language:MLIRLicense:NOASSERTIONStargazers:99Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38274Issues:0Issues:0

mlir-tcp

Tensor Compute Primitives: Mid-level Intermediate Representation for Machine Learning Programs

Language:MLIRLicense:NOASSERTIONStargazers:20Issues:0Issues:0

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1483Issues:0Issues:0

SHARK

SHARK - High Performance Machine Learning Distribution

Language:PythonLicense:Apache-2.0Stargazers:1399Issues:0Issues:0
Language:MLIRStargazers:379Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

MODel_opt

Memory Optimizations for Deep Learning (ICML 2023)

Language:PythonLicense:MITStargazers:57Issues:0Issues:0

conan

Conan - The open-source C and C++ package manager

Language:PythonLicense:MITStargazers:7950Issues:0Issues:0

onnx-tensorflow

Tensorflow Backend for ONNX

Language:PythonLicense:NOASSERTIONStargazers:1255Issues:0Issues:0

tensorflow-onnx

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2256Issues:0Issues:0

CMSIS-NN

CMSIS-NN Library

Language:CLicense:Apache-2.0Stargazers:166Issues:0Issues:0

stablehlo

Backward compatible ML compute opset inspired by HLO/MHLO

Language:MLIRLicense:Apache-2.0Stargazers:356Issues:0Issues:0

c-blosc

A blocking, shuffling and loss-less compression library that can be faster than `memcpy()`.

Language:CLicense:NOASSERTIONStargazers:972Issues:0Issues:0

onnx

Open standard for machine learning interoperability

Language:PythonLicense:Apache-2.0Stargazers:17190Issues:0Issues:0

mindspore

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

Language:C++License:Apache-2.0Stargazers:4132Issues:0Issues:0

conan-center-index

Recipes for the ConanCenter repository

Language:PythonLicense:MITStargazers:919Issues:0Issues:0

edotor.net

Your favourite Graphviz editor

Language:TypeScriptLicense:MITStargazers:175Issues:0Issues:0

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:1590Issues:0Issues:0