OpenNMT / CTranslate2

Fast inference engine for Transformer models

Home Page:https://opennmt.net/CTranslate2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Congrats on flash attention, now how do I run it???

BBC-Esq opened this issue · comments

Saw that the release finally uploaded to pypi and am excited to test FA...how do I do it? I saw instructions somewhere about setting a boolean parameter??

Nevermind, found it.

Nevermind, found it.

Can you help me find it?