ggerganov / ggml

Tensor library for machine learning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[Feature request] Add support/demo implementation for Qwen-VL GGUF model

CoruNethron opened this issue · comments

Hello. May be this can be interesting for future roadmap:
https://github.com/QwenLM/Qwen-VL

It is multimodal and multilangual 7B model, able to analyze image, including text recognition and compare two (at least) images. Also able to detect bounding box of an object within image.

Seems, like it also beats some very good 13B models in pure textual context.

photo_2023-08-30_04-40-45

Would be nice to see it running quantized in GGUF.

Closing this, as Qwen inference has being added few days ago:
https://github.com/QwenLM/qwen.cpp

Sorry for increasing entropy here, just realized, that recently implemented inference is for another model:
https://github.com/QwenLM/Qwen
vs
https://github.com/QwenLM/Qwen-VL
So, I reopen this feature request.