Does FasterTransformer use FlashAttention?
niyunsheng opened this issue · comments
Yunsheng Ni commented
Thank you in advance for any response.
Transformer related optimization, including BERT, GPT
niyunsheng opened this issue · comments
Thank you in advance for any response.