drgulevich / gtexfix

Fix for Google Translate to process LaTeX documents.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Tokens may overlap with the content

Ivanx32 opened this issue · comments

Hi! May be in the following situation it should be better to change the token name automatically?

./to.py arxiv/new.tex
LaTeX file: arxiv/new.tex
Token conflicts detected: ['[0,1]']
Tokens may overlap with the content. Change tokens or remove the source of conflict.

Thanks for the suggestion. I agree that the treatment of token conflicts should be automatic, but I did not want to go the way of making the tokens more sophisticated because this confuses Google Translate and results in more token misses. At least, the first thing I should do is NOT to search for token conflicts inside the preamble and formulas (as it is for now) which are to be hidden by tokens anyway.

In the attached file (extension was changed from '.tex' to ".txt") conflicts occur in formulas.
Perhaps, script should hide the formulas at first and then search for conflicts.

example.txt