[Bug] Tiny Llama sample not works properly

Question

invent00 opened this issue 2 months ago · comments

Describe the bug
tiny llama sample code are not working on my MTL system.
MatMul sample code works properly.

To Reproduce
Steps to reproduce the behavior:

   pip install intel-npu-acceleration-library

Expected behavior
execute tiny-llama on NPU and show result

Error Screenshots

environment:

Alessandro Palla · Answer 1 · Tue Apr 16 2024 15:44:30 GMT+0800 (China Standard Time)

Hi @invent00 , what transformers version do you have?

Alessandro Palla · Answer 2 · Tue Apr 16 2024 17:47:11 GMT+0800 (China Standard Time)

Can you try this PR: #11 I can replicate your issue with a newer version fo the transformers library

Alessandro Palla · Answer 3 · Tue Apr 16 2024 18:10:19 GMT+0800 (China Standard Time)

Should be fixed with #11. Closing this issue. Please let me know if you still have issues

invent00 · Answer 4 · Tue Apr 16 2024 22:37:51 GMT+0800 (China Standard Time)

Thank you for your support.
I will try new version.

transformers version:
transformers==4.39.3