A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
runzeer opened this issue a year ago · comments
I noticed that the performance for the math reasoning is lower than the official blog. Is it due to the zero-shot setting compared the official 5-shot setting?