Use LiLT / an alternative model with more than 512 tokens

Question

Use LiLT / an alternative model with more than 512 tokens

coding-kt opened this issue 10 months ago · comments

Hi,

LiLT processes a maximum of 512 tokens.

Is there a good option to get a comparable and commercial useable model that can process more tokens?

It is of course possible to split longer inputs into 512 token chunks. But this comes with some disadvantages / difficulties.

Jiapeng Wang · Answer 1 · Tue May 14 2024 14:53:48 GMT+0800 (China Standard Time)

Hi,
LiLT uses the token length 512 during the pre-training phase. The most direct way to process long document is to split the document into chunks with length 512. Alternatively, it is possible to consider linearly resizing the position embedding and then fine-tuning it.