convert Qwen question

Question

convert Qwen question

OneStepAndTwoSteps opened this issue a year ago · comments

KAYNE CHEN commented a year ago

hello，我按您提供的convert Qwen在做qwen的HF转ONNX的时候得到了四组onnx的后缀文件和一些外接权重，我想请教一下如果要load模型进行推理我应该怎么做

Luchang Li · Answer 1 · Mon Aug 28 2023 08:51:09 GMT+0800 (China Standard Time)

可以参考https://github.com/tpoisonooo/llama.onnx/tree/main

Luchang Li · Answer 2 · Mon Aug 28 2023 08:55:09 GMT+0800 (China Standard Time)

这几个模型也可以参考export_llama_single.py导出一个模型，推理其实挺简单的。逻辑简单的来说就是prompt->input_ids->embeding->decoder_layers->output norm->lm_head得到lm_logics，然后再据此用topk，topp等方法预测下一个token input_id再转到前面embeding部分进行循环直到发现结束token或者满足其他结束条件。

MingkangW · Answer 3 · Fri Dec 29 2023 11:33:51 GMT+0800 (China Standard Time)

请问您成功推理了吗，我最近也在尝试这个，但是刚接触AI不知道怎么弄

Luchang Li · Answer 4 · Fri Dec 29 2023 11:35:22 GMT+0800 (China Standard Time)

可以的，多尝试吧

…

---- 回复的原邮件 ---- | 发件人 | ***@***.***> | | 日期 | 2023年12月29日 11:34 | | 收件人 | ***@***.***> | | 抄送至 | Luchang ***@***.***>***@***.***> | | 主题 | Re: [luchangli03/export_llama_to_onnx] convert Qwen question (Issue #1) | 请问您成功推理了吗，我最近也在尝试这个，但是刚接触AI不知道怎么弄 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>