EleutherAI / oslo

OSLO: Open Source for Large-scale Optimization

Home Page:https://oslo.eleuther.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

fused_scale_mask_softmax on GPT2 model

loopinf opened this issue · comments

Describe a TODO feature

  • Current implementation does not use scale part on fused_scale_mask_softmax
  • Change it to use only not reorder_and_upcast part

Assignees