chatglm多gpu用deepspeed和
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
llplay opened this issue a year ago · comments
No description provided.
inference.py和webui文件夹底下就是
有多gpu的版本吗,使用多gpu推断的时候老是报RuntimeError: expected scalar type Half but found Float的错误
@liangwq