RuntimeError: Error(s) in loading state_dict for FastTextSeq2SeqModel

Question

RuntimeError: Error(s) in loading state_dict for FastTextSeq2SeqModel

arinzea opened this issue 3 years ago · comments

I am running the example from the docs https://deepparse.org/examples/retrain_with_new_prediction_tags.html.
however the retrain function
address_parser.retrain(training_container, 0.8, epochs=5, batch_size=8, num_workers=2, callbacks=[lr_scheduler], prediction_tags=tag_dictionary, logging_path=logging_path)

is coming up with error:

RuntimeError: Error(s) in loading state_dict for FastTextSeq2SeqModel: size mismatch for decoder.linear.weight: copying a param with shape torch.Size([9, 1024]) from checkpoint, the shape in current model is torch.Size([3, 1024]). size mismatch for decoder.linear.bias: copying a param with shape torch.Size([9]) from checkpoint, the shape in current model is torch.Size([3]).

Marouane Yassine · Answer 1 · Mon Oct 04 2021 21:30:58 GMT+0800 (China Standard Time)

Hello Arinze,

This is strange. I've just run the example and I was able to retrain the model without issues.

Are you running the last version of deepparse?

If not, could you please upgrade and try again?

Arinze Akutekwe · Answer 2 · Fri Oct 08 2021 05:56:52 GMT+0800 (China Standard Time)

Thanks Marouane, it now works with the fixed version 0.4.4.
I have a use case for retraining sample English address data (from AddressBase) and parsing some of the fields.