Evaluate a quantize model

Question

Evaluate a quantize model

Psancs05 opened this issue 9 months ago · comments

I am trying to run an evaluation in a model that is quantized. I have to instantiate it using 'accelerate' to use the GPU, because otherwise it cannot fit in memory. The problem is that when using compute() in the metric I want, I get this error:

ValueError: The model has been loaded with accelerateand therefore cannot be moved to a specific device. Please discard thedevice argument when creating your pipeline object.

Is there any way to allow using the compute() method in a model that is in GPU?

Pedro Vitor Quinta de Castro · Answer 1 · Tue Jan 16 2024 23:26:53 GMT+0800 (China Standard Time)

I'm also having the same problem... @Psancs05 were you able to find a workaround for this?

shuvam mandal · Answer 2 · Wed Feb 21 2024 22:40:46 GMT+0800 (China Standard Time)

yeah me too what's the solution for it