APPEND won't happen in some cases

Question

APPEND won't happen in some cases

linhkid opened this issue 3 years ago · comments

Nguyen Khanh Linh commented 3 years ago

Let say, for example, my corrupt sentence is

"A B C"

I would like to replace "A" with "D E"

But the result is: "D B C", it should be "D E B C"

Checked my .m2 training data, it has them all there but when I tried to predict, the other token "E" is always gone.

I checked during the inference (the variable "sugg_token" and there is no tokens or actions for the word "E")

What could be the reason? I can fix it myself but it might take a long time though. Appreciate any helps!

Alex Skurzhanskyi · Answer 1 · Wed Jun 09 2021 01:44:51 GMT+0800 (China Standard Time)

Yes, that's true. This is because of the limitation of our architecture. During 1 iteration, we can predict only 1 action per token. That's why we remove other tags during preprocessing. I would suggest splitting this example into two.

Nguyen Khanh Linh · Answer 2 · Wed Jun 09 2021 10:28:34 GMT+0800 (China Standard Time)

Ok thanks, or maybe I can just add an underline between them, then remove in postprocess

Alex Skurzhanskyi · Answer 3 · Wed Jun 09 2021 18:36:04 GMT+0800 (China Standard Time)

I'm not sure if this is a good solution. Such a tag will be very rare and won't have enough examples (if I understand the nature of your task).