tspeterkim / flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

tspeterkim/flash-attention-minimal Stargazers