Support for Mistral-7b

Question

Support for Mistral-7b

ranaya-formant opened this issue a year ago · comments

ranaya-formant commented a year ago

Do you all support this? https://mistral.ai/news/announcing-mistral-7b/

Chris Raethke · Answer 1 · Thu Sep 28 2023 23:47:07 GMT+0800 (China Standard Time)

huggingface/candle#983

Philpax · Answer 2 · Tue Oct 31 2023 09:52:27 GMT+0800 (China Standard Time)

Apologies for the late reply on this. I've tested two Mistral 7B-derived models (https://huggingface.co/TheBloke/Mistral-7B-Claude-Chat-GGUF and https://huggingface.co/TheBloke/zephyr-7B-beta-GGUF) with #412, and all seems to work. I'll keep this issue open until that PR lands.

Sven-Hendrik Haase · Answer 3 · Sun Dec 03 2023 21:22:53 GMT+0800 (China Standard Time)

Is there an update on this?

Sven-Hendrik Haase · Answer 4 · Sun Dec 03 2023 22:04:06 GMT+0800 (China Standard Time)

Ok I sort of figured it out. For others looking into this:

Use the GGUF branch
Run it like this: cargo run --release -- infer -m ~/llms/mistral-7b-instruct-v0.1.Q4_K_S.gguf -p "Write a long story" -n 5000 -r mistralai/Mistral-7B-v0.1
The important bit is to tell it to use the tokenizer at the end.

Philpax · Answer 5 · Mon Dec 04 2023 08:29:37 GMT+0800 (China Standard Time)

Yup - the gguf branch's embedded tokenizer support doesn't quite work, so I've disabled it for now. I'm rebuilding the library in develop to target the latest llama.cpp, but that's going to take some time.