Beast code in Giters

a.saenko's starred repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.038661 384 1652

v2rayNG

A V2Ray client for Android, support Xray core and v2fly core

Language:KotlinGPL-3.034731 535 2796

GoodbyeDPI

GoodbyeDPI — Deep Packet Inspection circumvention utility (for Windows)

Language:CApache-2.023789 431 604

HVM

A massively parallel, optimal functional runtime in Rust

Language:CudaApache-2.010441 99 189

taskflow

A General-purpose Task-parallel Programming System using Modern C++

Language:C++NOASSERTION10116 255 454

conan

Conan - The open-source C and C++ package manager

Language:PythonMIT8160 134 10554

antigen

The plugin manager for zsh.

Language:ShellMIT8001 103 373

tensorflow-onnx

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Language:Jupyter NotebookApache-2.02300 58 1041

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookApache-2.01524 29 174

LenovoLegionLinux

Driver and tools for controlling Lenovo Legion laptops in Linux including fan control and power mode.

Language:CGPL-2.01507 18 189

atomic_queue

C++ lockless queue.

Language:C++MIT1480 44 46

SHARK

SHARK - High Performance Machine Learning Distribution

Language:PythonApache-2.01412 41 562

onnx-tensorflow

Tensorflow Backend for ONNX

Language:PythonNOASSERTION1273 50 546

buddy-mlir

An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).

Language:C++Apache-2.0492 13 52

nvbench

CUDA Kernel Benchmarking Library

Language:CudaApache-2.0485 17 96

Polygeist

C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!

Language:C++NOASSERTION469 21 136

mlir-hlo

Language:MLIR396 25 48

cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Language:CudaMIT270 4 12

CMSIS-NN

CMSIS-NN Library

Language:CApache-2.0194 11 38

tensornetwork.org

Source for The Tensor Network open-source review article

Language:TeXApache-2.0146 10 13

mlir-extensions

Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.

Language:MLIRNOASSERTION117 18 89

tpp-mlir

TPP experimentation on MLIR for linear algebra

Language:MLIRNOASSERTION110 8 222

polymer

Bridging polyhedral analysis tools to the MLIR framework

Language:C++MIT99 9 58

cute-gemm

Language:C++70 2 4

MODel_opt

Memory Optimizations for Deep Learning (ICML 2023)

Language:PythonMIT58 36 2

FXdiv

C99/C++ header-only library for division via fixed-point multiplication by inverse

Language:C++MIT46 9 1

sionnx

Auto-gen Tests Tool for ONNX Compliance

Language:LLVMNOASSERTION38 5 1

mlir-tcp

Tensor Compute Primitives: Mid-level Intermediate Representation for Machine Learning Programs

Language:MLIRNOASSERTION35 10 2

iree-comparative-benchmark

Compiler-agnostic benchmark suites for comparing projects

Language:PythonApache-2.010 4 40

cutt

CUDA Tensor Transpose (cuTT) library

Language:C++9 20