[Feature] check the compatibility of a hugging face model before fully downloading it ?

Question

[Feature] check the compatibility of a hugging face model before fully downloading it ?

SuperUserNameMan opened this issue 24 days ago · comments

Feature Request

Hello again,

It would be cool if the Chat app was able to check the compatibility of a huggingface model before downloading it fully.

Maybe it could be done by checking the GGUF header (if it has one) into the incomplete download file as soon as available ?

Prism · Answer 1 · Tue May 28 2024 22:28:04 GMT+0800 (China Standard Time)

Well, just check if its: GUFF and use Q_4 quantization don't solve the problem of vram, example: some model like dolphin 13b won't work, but falcon 13b works fine: why? model size and GPU type, no easy solution for that. only try and fail.