- See also: Chunked Scan
- Tweets https://twitter.com/darkproger/status/1741453063980830878
- CUDA mode group has an aggregator of more scan implementations: https://github.com/cuda-mode/resource-stream
- block scan algorithms in cub
Parallel Associative Scan for Language Models
Parallel Associative Scan for Language Models
ISC License