Error message in checkpoint_{loader, saver}_megatron.py is usually a red herring
Mittagskogel opened this issue · comments
Megatron-DeepSpeed/tools/checkpoint_loader_megatron.py
Lines 28 to 39 in a7b7cb7
The problem here is usually not related to finding the megatron directory, but rather outdated megatron or missing subdependencies. Please remove the error message or at least print the underlying error and suggest specifying the megatron directory, instead of assuming that this is the root of the issue.
+1. encountered same problem. current except handler just print error but not telling which module failed importing.