mgoin / flash-attention

Fast and memory-efficient exact attention

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository is not active

About

Fast and memory-efficient exact attention

License:BSD 3-Clause "New" or "Revised" License


Languages

Language:Python 47.0%Language:C++ 34.8%Language:Cuda 17.7%Language:Dockerfile 0.3%Language:C 0.1%Language:Shell 0.0%Language:Makefile 0.0%