BUG When I reasoning the model Qwen-VL-Chat-Int4 and Yi-VL-6B, the Model Engine cannot be found

Question

BUG When I reasoning the model Qwen-VL-Chat-Int4 and Yi-VL-6B, the Model Engine cannot be found

okwinds opened this issue a month ago · comments

okwinds commented a month ago

First, register the model as shown in the following screenshot.

Second, find the Qwen-VL-Chat-Int4 inference in the list of custom models, as shown in the following screenshot

Qwen-VL-Chat-Int4

xinference, version 0.12.1

okwinds · Answer 1 · Tue Jun 18 2024 13:11:51 GMT+0800 (China Standard Time)

Additional explanation
Yi-VL-6B has the same issue.

Chengjie Li · Answer 2 · Fri Jun 21 2024 17:48:33 GMT+0800 (China Standard Time)

@okwinds provide us the full screenshot. Did you select vision for VL models in Model Abilities section on the UI?

okwinds · Answer 3 · Sun Jun 23 2024 00:03:36 GMT+0800 (China Standard Time)

@okwinds provide us the full screenshot. Did you select vision for VL models in Model Abilities section on the UI?

yep, Registered again

yiyangshen · Answer 4 · Tue Jun 25 2024 18:36:38 GMT+0800 (China Standard Time)

Do not choose Generate in Abilities.

okwinds · Answer 5 · Wed Jun 26 2024 11:06:55 GMT+0800 (China Standard Time)

Do not choose Generate in Abilities.不要选择“能力”中的“生成”。

tried again
it cannot work

"model_ability": [
    "vision",
    "chat"
],

amumu96 · Answer 6 · Fri Jul 19 2024 11:07:04 GMT+0800 (China Standard Time)

This error cannot be reproduced in version 0.13.1, please try upgrading to the latest version. @okwinds

okwinds · Answer 7 · Fri Jul 19 2024 22:31:39 GMT+0800 (China Standard Time)

The version of Xinference that I updated.

now, xinference, version 0.13.2.

The same operation process, the same problems still exist.

json:

{
    "version": 1,
    "context_length": 20000,
    "model_name": "Yi-VL-6B",
    "model_lang": [
        "en",
        "zh"
    ],
    "model_ability": [
        "chat",
        "vision"
    ],
    "model_description": "/home/llm/yi/Yi-VL-6B",
    "model_family": "yi-vl-chat",
    "model_specs": [
        {
            "model_format": "pytorch",
            "model_size_in_billions": 6,
            "quantizations": [
                "none"
            ],
            "model_id": null,
            "model_hub": "huggingface",
            "model_uri": "/home/llm/yi/Yi-VL-6B",
            "model_revision": null
        }
    ],
    "prompt_style": {
        "style_name": "CHATML",
        "system_prompt": "",
        "roles": [
            "<|im_start|>user",
            "<|im_start|>assistant"
        ],
        "intra_message_sep": "<|im_end|>",
        "inter_message_sep": "",
        "stop": [
            "<|endoftext|>",
            "<|im_start|>",
            "<|im_end|>",
            "<|im_sep|>"
        ],
        "stop_token_ids": [
            2,
            6,
            7,
            8
        ]
    },
    "is_builtin": false
}

@amumu96 @qinxuye

XueQi Zhang · Answer 8 · Mon Jul 22 2024 12:02:16 GMT+0800 (China Standard Time)

插眼，我使用0.11.3和0.13.1都存在这个问题

lorra.guo · Answer 9 · Mon Jul 22 2024 16:07:49 GMT+0800 (China Standard Time)

0.13.2版本问题还是存在的如果之前部署过其他模型有config缓存，可以正常配置，如果没有缓存的引擎选择不了，下拉框为空

ricky977 · Answer 10 · Mon Jul 29 2024 18:06:32 GMT+0800 (China Standard Time)

same question!

Yog-AI · Answer 11 · Thu Aug 01 2024 14:37:38 GMT+0800 (China Standard Time)

0.12.0的含金量还在上升，我通过版本降级解决了
pip install xinference==0.12.0

ricky977 · Answer 12 · Thu Aug 01 2024 14:53:27 GMT+0800 (China Standard Time)

0.12.0的含金量还在上升，我通过版本降级解决了 pip install xinference==0.12.0
Pip install xinference==0.12.0 Is xinference [all] installed by default and how to install xinference [transformers]

Xuye (Chris) Qin · Answer 13 · Thu Aug 01 2024 15:13:16 GMT+0800 (China Standard Time)

主分支应该已经解决，本周发版后请尝试。使用镜像的话可以尝试拉 nightly-main 分支。