yashkant / sam-textvqa

Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.

Home Page:https://yashkant.github.io/projects/sam-textvqa

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

question about training the model?

vhzy opened this issue · comments

Hi,thanks for open source your code.
I run the code on my server with 62G memory.After running for a while, the training was interrupted.
I found a similar phenomenon in the previous issue:
#2 (comment)
I wonder how much memory is needed to train this model?
Also,should I convert the dataset into npy files?