Fail to obtain the reported MACs for performer-based models.

Question

Fail to obtain the reported MACs for performer-based models.

blackfeather-wang opened this issue 3 years ago · comments

Hi,

Thank you for this repo! It is really helpful. However, we fail to obtain the reported MACs for performer-based models (T2T-ViT-7/10/12). Importantly, we found a strange phenomenon.

Both the paper and the code indicate that T2T-ViT-7/10/12 have the same architectures for the T2T module and transformer layers, and differ from each other only on the number of transformer layers. From the reported MACs (shown below), one can observe that the MACs for a single transformer layer is (1.8 - 1.2) / 3 = (2.2 - 1.8) / 2 = 0.2G. As a consequence, for the T2T-ViT-7, we have 1.2 - 0.2*7 < 0, which indicates that the T2T module has negative MACs! Would you please to tell us if we miscalculate something?

YuanLi · Answer 1 · Thu Apr 29 2021 15:18:47 GMT+0800 (China Standard Time)

Hi, thanks for notice.

For the three lite variants of T2T-ViT, each layer of Transformer layer is 0.125G MACs.

The MACs of T2T-ViT-7 is 0.125*7+0.7, here 0.7 is for T2T module. So The MACs of T2T-ViT-7, T2T-ViT-10, T2T-ViT-12 are 1.57G, 1.9G and 2.2G, we will update the results in repo.

Rainforest Wang · Answer 2 · Thu Apr 29 2021 15:59:51 GMT+0800 (China Standard Time)

Thank you for your reply.

How are the MACs of the T2T modules obtained? In fact, we have tried to calculate the MACs following the code, but we got much smaller MACs (i.e., ~0.25G).

Yufei Xu · Answer 3 · Mon May 10 2021 21:16:35 GMT+0800 (China Standard Time)

Hi.
I got a similar 0.25G MACs for the T2T modules. I tried to calculate the MACs following the code and the repo as you suggested in another issue for exp, sum, and divide operations' MACs calculation. Can you give some hints? Thanks in advance.

YuanLi · Answer 4 · Sun May 23 2021 16:34:36 GMT+0800 (China Standard Time)

Hi,

We double checked the MACs of T2T module, and it should be ~0.25G, and we have updated the repo and will update the paper soon.

Geonhwa Jeong · Answer 5 · Sat Jul 02 2022 02:20:05 GMT+0800 (China Standard Time)

Hi @yuanli2333, thanks for the great work. May I ask you to share the actual script that you used to calculate T2T and the original ViT as well? It would be very helpful if you do so .. I have found different papers report different MAC numbers for the original ViT as well.