ValueError: The input provided to the model are wrong. The number of image tokens is 0 while the number of image given to the model is 1. This prevents correct indexing and breaks batch generation.

Question

ValueError: The input provided to the model are wrong. The number of image tokens is 0 while the number of image given to the model is 1. This prevents correct indexing and breaks batch generation.

zhangchunjie1999 opened this issue 22 days ago · comments

zhangchunjie1999 commented 22 days ago

This problem occurred when I was running a demo of the 34b model.
How can I fix this ？

zhangchunjie1999 · Answer 1 · Sat May 11 2024 11:17:42 GMT+0800 (China Standard Time)

zhangchunjie1999 commented 22 days ago

Kaiyue Sun · Answer 2 · Sun May 12 2024 19:45:50 GMT+0800 (China Standard Time)

same problem

zhangchunjie1999 · Answer 3 · Mon May 13 2024 10:27:53 GMT+0800 (China Standard Time)

I found a solution.
In tasks/eval/model_util.py,
remove
# try: # processor = PllavaProcessor.from_pretrained(repo_id) # except Exception as e: # processor = PllavaProcessor.from_pretrained('llava-hf/llava-1.5-7b-hf')
and add:
processor = PllavaProcessor.from_pretrained(repo_id)
then:
Install the required packages

PavanMV · Answer 4 · Wed May 15 2024 17:55:06 GMT+0800 (China Standard Time)

Its image_token_index problem, looks like new LlamaTokenizer image_token_index = 64000 where as pllava config image_token_index = 64002, change it to 64000.