all-in-one based version 2.1.0b1 issue for llama3-8b-instruct with 128-2048
Fred-cell opened this issue · comments
Last iterator is verfy slow for llama3-8b-instruct & fp8.
Cannot reproduce the issue in the env provided by user, will sync details offline.
After sync with user offline, cannot reproduce in users' env, will keep monitoring similar behavior, close issue for now.