microsoft / Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

error in generate_text.sh

lonelydancer opened this issue · comments

when i run
sh examples_deepspeed/generate_text.sh
CHECKPOINT_PATH=dataset/checkpoints/gpt2_345m

File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 1423, in _call_impl
return forward_call(*input, **kwargs)
TypeError: forward() got an unexpected keyword argument 'layer_past'

+1 to the issue, it seems the wrong model class is being referenced in the script.