EleutherAI / oslo

OSLO: Open Source for Large-scale Optimization

Home Page:https://oslo.eleuther.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Add description how to use fused_scale_softmax

loopinf opened this issue · comments

Describe a TODO feature

  • It is hard to know how to use fused scale mask softmax
    • what is scale value and how it is used in attention layer.
    • missing test case for scale value result for not scale = 1.0

Assignees

How about holding a meeting for this?
Please see discord.