NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Does FasterTransformer use FlashAttention?

niyunsheng opened this issue · comments

Thank you in advance for any response.