Lookahead Decoding Development Roadmap

Question

Lookahead Decoding Development Roadmap

Viol2000 opened this issue 7 months ago · comments

Viol2000 commented 7 months ago

Software Quality

Refactor Code #9
Simple way to add new model

Implementation

New Models

Baichuan #11
QWen #22

排气扇 · Answer 1 · Tue Jan 16 2024 23:21:05 GMT+0800 (China Standard Time)

Does this project support the vicuna model?

Viol2000 · Answer 2 · Tue Jan 16 2024 23:25:44 GMT+0800 (China Standard Time)

Does this project support the vicuna model?

Vicuna is already supported because it is based on LlamaForCausalLM.

排气扇 · Answer 3 · Tue Jan 16 2024 23:28:19 GMT+0800 (China Standard Time)

Thank you for your reply!Do you mean that I can use the following code to observe the acceleration effect of the vicuna model?
USE_LADE=1 python applications/chatbot.py --model_path meta-llama/vicuna-7b-v.13 --debug #no chat, with lookahead
USE_LADE=0 python applications/chatbot.py --model_path meta-llama/vicuna-7b-v.13 --debug #no chat, without lookahead

Viol2000 · Answer 4 · Tue Jan 16 2024 23:29:57 GMT+0800 (China Standard Time)

Thank you for your reply!Do you mean that I can use the following code to observe the acceleration effect of the vicuna model? USE_LADE=1 python applications/chatbot.py --model_path meta-llama/vicuna-7b-v.13 --debug #no chat, with lookahead USE_LADE=0 python applications/chatbot.py --model_path meta-llama/vicuna-7b-v.13 --debug #no chat, without lookahead

It should be lmsys/vicuna-7b-v1.3 and yes.

排气扇 · Answer 5 · Tue Jan 16 2024 23:35:55 GMT+0800 (China Standard Time)

Got it!Thank you for your reply again!

LiweiPE · Answer 6 · Wed Mar 06 2024 11:18:52 GMT+0800 (China Standard Time)

Hi, Im really interesting in this decoding development. Is there any progress to integrate in Qwen model? thanks.