karpathy / llama2.c

Inference Llama 2 in one file of pure C

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Code/script to reproduce val loss using the shared models

Alexey234432 opened this issue · comments

Hi,

does anyone know if there is a script/code to reproduce val loss using provided "*.bin" models? I've tried myself and can't get the numbers shared.

Thank you.

Same issue here.

in my case loss values are slightly higher - is it the same for you? ie 1.072 for 15M model is my case is 1.0833 and 0.760 for 110M model jumps to 0.8725 @DavidHerel

Thank you

Yeah, I think it was something similar to you.

I did not play with lr, warmup and dropout, so maybe more extensive hyperparams search will get us the results?