WizardCoder llama assert failure

Question

jacohend opened this issue a year ago · comments

Trying to run a variety of ggml models from TheBloke leads to this error:
GGML_ASSERT: llama-cpp/ggml.c:6270: ggml_nelements(a) == ne0*ne1*ne2

Wondering if anyone else is experiencing this, and what the issue might be?

Jacob Henderson · Answer 1 · Mon Aug 28 2023 13:01:13 GMT+0800 (China Standard Time)

Lukas Kreussel · Answer 2 · Mon Aug 28 2023 16:16:33 GMT+0800 (China Standard Time)

Probably another issue with the currently used ggml version, a re-sync with the current main branch of llama.cpp is probably needed.

Jacob Henderson · Answer 3 · Mon Aug 28 2023 22:56:25 GMT+0800 (China Standard Time)

I actually did that and found a failure on the same assert line. The linked comment said rolling the version back worked best.

I'm wondering if this assert is assuming constant layer sizes, so any modification like The Bloke does might be causing the failure?