bigscience-workshop / Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RuntimeError: Error building extension 'scaled_upper_triang_masked_softmax_cuda'

zll0000 opened this issue · comments

RuntimeError: Error building extension 'scaled_upper_triang_masked_softmax_cuda'