WizardCoder hallucinations or bug in inference params?

Question

WizardCoder hallucinations or bug in inference params?

Extremys opened this issue a year ago · comments

WizardCoder is really subject to expose roles from the prompt template and continue the discussion like this:

Do you have an idea what could cause this undesirable behavior? How to avoid that?
I am using the fastchat repo for inference reusing alpaca prompt template, do you see any bug?
https://github.com/lm-sys/FastChat/blob/main/fastchat/conversation.py#L378C26-L378C26

Thanks.

ChiYeung Law · Answer 1 · Sat Aug 19 2023 20:04:29 GMT+0800 (China Standard Time)

Your template has a problem. The stop token of WizardCoder <|endoftext|> is different from Alpaca </s>.

Extremys · Answer 2 · Mon Aug 21 2023 14:52:27 GMT+0800 (China Standard Time)

thanks for your feedback Chi, I think there is an other because switching from to <|endoftext|> do not resolve the tricky behavior :) any idea?

ChiYeung Law · Answer 3 · Tue Aug 22 2023 09:57:50 GMT+0800 (China Standard Time)

I try to say Hi!. It seems to work well.

ChiYeung Law · Answer 4 · Tue Aug 22 2023 09:59:55 GMT+0800 (China Standard Time)

My suggestion is to use the raw template rather than the template from FastChat.
Raw template:

Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

### Instruction:
{instruction}

### Response:

Extremys · Answer 5 · Tue Aug 22 2023 18:21:14 GMT+0800 (China Standard Time)

Hello Chi, I found the root cause thanks for your feedback it was coming from the worker implementation :)