Nadav Rotem's repositories
memset_benchmark
This repository contains high-performance implementations of memset and memcpy in assembly.
compressor
An educational implementation of a modern compressor in Rust
bistra
Bistra is a domain-specific language designed to generate high-performance kernels (such as GEMMs, convolutions, etc). The program is designed to allow powerful compiler optimizations and code generation that are not possible in C. The tool can auto-tune GEMM kernels to around 90% of peak performance (on X86/AVX2) within seconds.
triton
Development repository for the Triton language and compiler
Language:C++MIT000