TP=2， Loss of accuracy

Question

TP=2， Loss of accuracy

coderchem opened this issue a year ago · comments

hello,I carried out TP=2,multi-gpu operation on llama model of 7b, and found that the comparison accuracy of the result was lost by 5%. TP=2 as far as I know should not change the accuracy. Why?

hurun · Answer 1 · Sat Jul 29 2023 18:55:28 GMT+0800 (China Standard Time)

hi, can you post reproduce step.

byshiue_NV · Answer 2 · Fri Oct 20 2023 15:26:27 GMT+0800 (China Standard Time)

FasterTransformer does not support llama officially and FasterTransformer development has transitioned to TensorRT-LLM. TensorRT-LLM has supported LLaMa, please take a try.