Reprocessing Issue with Llama 3

Question

Reprocessing Issue with Llama 3

Nabokov86 opened this issue 3 months ago · comments

When using Llama 3, I've noticed that unnecessary reprocessing occurs on previously generated text.
To reproduce this issue, try generating a short piece of text couple of times and see how the processing sometimes happens.

Latest concedo_experimental.

Evgeny · Answer 1 · Tue Apr 23 2024 19:48:55 GMT+0800 (China Standard Time)

It seems like the reprocessing occurs after a new line is generated.

LostRuins Concedo · Answer 2 · Wed Apr 24 2024 18:22:08 GMT+0800 (China Standard Time)

Did you by any chance enable "Trim Sentences" or "Author Note"?

Evgeny · Answer 3 · Wed Apr 24 2024 18:37:53 GMT+0800 (China Standard Time)

No, I use default settings without trimming. So, you can't reproduce it?
saved_story.json

LostRuins Concedo · Answer 4 · Wed Apr 24 2024 20:19:31 GMT+0800 (China Standard Time)

Yes, I can reproduce it. Looking closer, the tokenizer is behaving weirdly. I think there is an issue with token merges.

Relevant: ggerganov#6809

You should experience a small amount of reprocessing all the way back to the previous newline. This is a bug.

LostRuins Concedo · Answer 5 · Wed May 01 2024 22:19:22 GMT+0800 (China Standard Time)

Hi, Should be fixed in the latest version. Remember to get freshly reconverted GGUFs

Evgeny · Answer 6 · Fri May 03 2024 01:16:21 GMT+0800 (China Standard Time)

@LostRuins Thanks! Yes, it looks like it’s working now. Thank you for continuing to maintain this project, you’re awesome!