a.saenko's starred repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38661Issues:384Issues:1652

v2rayNG

A V2Ray client for Android, support Xray core and v2fly core

Language:KotlinLicense:GPL-3.0Stargazers:34731Issues:535Issues:2796

GoodbyeDPI

GoodbyeDPI — Deep Packet Inspection circumvention utility (for Windows)

Language:CLicense:Apache-2.0Stargazers:23789Issues:431Issues:604

HVM

A massively parallel, optimal functional runtime in Rust

Language:CudaLicense:Apache-2.0Stargazers:10441Issues:99Issues:189

taskflow

A General-purpose Task-parallel Programming System using Modern C++

Language:C++License:NOASSERTIONStargazers:10116Issues:255Issues:454

conan

Conan - The open-source C and C++ package manager

Language:PythonLicense:MITStargazers:8160Issues:134Issues:10554

antigen

The plugin manager for zsh.

Language:ShellLicense:MITStargazers:8001Issues:103Issues:373

tensorflow-onnx

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2300Issues:58Issues:1041

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1524Issues:29Issues:174

LenovoLegionLinux

Driver and tools for controlling Lenovo Legion laptops in Linux including fan control and power mode.

Language:CLicense:GPL-2.0Stargazers:1507Issues:18Issues:189

atomic_queue

C++ lockless queue.

Language:C++License:MITStargazers:1480Issues:44Issues:46

SHARK

SHARK - High Performance Machine Learning Distribution

Language:PythonLicense:Apache-2.0Stargazers:1412Issues:41Issues:562

onnx-tensorflow

Tensorflow Backend for ONNX

Language:PythonLicense:NOASSERTIONStargazers:1273Issues:50Issues:546

buddy-mlir

An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).

Language:C++License:Apache-2.0Stargazers:492Issues:13Issues:52

nvbench

CUDA Kernel Benchmarking Library

Language:CudaLicense:Apache-2.0Stargazers:485Issues:17Issues:96

Polygeist

C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!

Language:C++License:NOASSERTIONStargazers:469Issues:21Issues:136

cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Language:CudaLicense:MITStargazers:270Issues:4Issues:12

CMSIS-NN

CMSIS-NN Library

Language:CLicense:Apache-2.0Stargazers:194Issues:11Issues:38

tensornetwork.org

Source for The Tensor Network open-source review article

Language:TeXLicense:Apache-2.0Stargazers:146Issues:10Issues:13

mlir-extensions

Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.

Language:MLIRLicense:NOASSERTIONStargazers:117Issues:18Issues:89

tpp-mlir

TPP experimentation on MLIR for linear algebra

Language:MLIRLicense:NOASSERTIONStargazers:110Issues:8Issues:222

polymer

Bridging polyhedral analysis tools to the MLIR framework

Language:C++License:MITStargazers:99Issues:9Issues:58

MODel_opt

Memory Optimizations for Deep Learning (ICML 2023)

Language:PythonLicense:MITStargazers:58Issues:36Issues:2

FXdiv

C99/C++ header-only library for division via fixed-point multiplication by inverse

Language:C++License:MITStargazers:46Issues:9Issues:1

sionnx

Auto-gen Tests Tool for ONNX Compliance

Language:LLVMLicense:NOASSERTIONStargazers:38Issues:5Issues:1

mlir-tcp

Tensor Compute Primitives: Mid-level Intermediate Representation for Machine Learning Programs

Language:MLIRLicense:NOASSERTIONStargazers:35Issues:10Issues:2

iree-comparative-benchmark

Compiler-agnostic benchmark suites for comparing projects

Language:PythonLicense:Apache-2.0Stargazers:10Issues:4Issues:40

cutt

CUDA Tensor Transpose (cuTT) library

Language:C++Stargazers:9Issues:2Issues:0