VisionEncoderDecoderModel convert

Question

VisionEncoderDecoderModel convert

sjtu-cz opened this issue 5 months ago · comments

How to convert the trained donut model into the model structure of VisionEncoderDecoderModel?

Felix · Answer 1 · Wed Feb 21 2024 18:20:20 GMT+0800 (China Standard Time)

Smells like an xy-problem, what exactly are you trying to do? Importing a donut model with the huggingface VisionEncoderDecoder implementation should be straight forward. Just make sure you use the right DonutTokenizer with it. The docs should cover what you are looking for:
https://huggingface.co/docs/transformers/model_doc/donut

sjtu-cz · Answer 2 · Wed Feb 21 2024 18:20:43 GMT+0800 (China Standard Time)

您好，你的邮件我已经收到~