lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How can I use Multiple NPUs ?

QuentinWang1 opened this issue · comments

for example :
python3 -m fastchat.serve.cli --model-path lmsys/vicuna-7b-v1.5 --num-gpus 2

maybe this following command need to be supported?
python3 -m fastchat.serve.cli --model-path lmsys/vicuna-7b-v1.5 --device npu --num-npus 2
image

image