About input text token max_length

Question

About input text token max_length

lyakaap opened this issue 2 years ago · comments

Great work!

I noticed max_length is set to 25 here, which is much smaller than 77 of CLIP.
How do you derive this setting?

viyjy · Answer 1 · Sat Jul 16 2022 22:56:49 GMT+0800 (China Standard Time)

Hi, thanks for you interest in our paper.
This setting is determined by (1) the average sentence length of the pre-training dataset and (2) computational efficiency.
Feel free to let me know if you might have any other questions/suggestions. Thanks.

lyakaap · Answer 2 · Thu Jul 21 2022 02:40:30 GMT+0800 (China Standard Time)

Thank you very much for your answer!