WIP
A tiny flash attention implement in python, rust, cuda and c for learning purpose.
- python version
- triton version
- [c version]
- [rust version]
flash attention tutorial written in python, triton, cuda, cutlass
WIP
A tiny flash attention implement in python, rust, cuda and c for learning purpose.
flash attention tutorial written in python, triton, cuda, cutlass