options used for the command train of calamari OCR

Question

options used for the command train of calamari OCR

Tailor2019 opened this issue 3 years ago · comments

Tailor2019 commented 3 years ago

Hello!
@ChWick @andbue
Please there is many options for this command(https://calamari-ocr.readthedocs.io/en/latest/doc.command-line-usage.html#calamari-train) . Are all these options usable for the training from scratch and the training from a pretrained model?
Which of these options preferable to use with the command train in the 2 cases(from scratch, from a pretrained model) in order to get best results of recognition?
Thanks in advance!

Andreas Büttner · Answer 1 · Thu Oct 14 2021 17:06:51 GMT+0800 (China Standard Time)

The parameters are already optimized for best results. You might be able to achieve a little lower CER by using larger networks, but only at the cost of longer training. For some insights in successful training procedures, have a look at this or that paper.

Besides from the parameters mentioned here, most parameters work for both training from scratch and warm starting. Setting parameters for network architecture does not work when starting with a pretrained model, obviously.

In everyday use I tend to set --n_augmentations=5 and train a set of 5 models using calamari-cross-fold-train.