[Feature request] Add support/demo implementation for Qwen-VL GGUF model

Question

[Feature request] Add support/demo implementation for Qwen-VL GGUF model

CoruNethron opened this issue 9 months ago · comments

Hello. May be this can be interesting for future roadmap:
https://github.com/QwenLM/Qwen-VL

It is multimodal and multilangual 7B model, able to analyze image, including text recognition and compare two (at least) images. Also able to detect bounding box of an object within image.

Seems, like it also beats some very good 13B models in pure textual context.

Would be nice to see it running quantized in GGUF.

CoruNethron · Answer 1 · Tue Oct 17 2023 10:58:53 GMT+0800 (China Standard Time)

Closing this, as Qwen inference has being added few days ago:
https://github.com/QwenLM/qwen.cpp

CoruNethron · Answer 2 · Tue Oct 17 2023 11:34:18 GMT+0800 (China Standard Time)

Sorry for increasing entropy here, just realized, that recently implemented inference is for another model:
https://github.com/QwenLM/Qwen
vs
https://github.com/QwenLM/Qwen-VL
So, I reopen this feature request.