zhihou7 / deit_share

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

#BatchFormerV2

The hyperparameters

--add_global 2 --insert_idx 8 --bt_lr 0.5 # for larger model, maybe, it requires to avoid overfitting. e.g. a smaller lr for bt

--add_global 2 --start_idx 0 # CIFAR

About

License:Apache License 2.0


Languages

Language:Python 99.9%Language:Shell 0.1%