How much compute to evaluate?

Question

How much compute to evaluate?

petergreis opened this issue 6 months ago · comments

I have just attempted to run the eval code on an A6000. While it starts, it looks like it just hangs. Hence my question - what did you run the eval code on? Same cluster as mentioned in the paper for training?

hf-lin · Answer 1 · Thu Apr 11 2024 21:46:45 GMT+0800 (China Standard Time)

We run eval code with an RTX4090 GPU. Actually, 14GB memory is enough for a 7B model to inference.

Peter Greis · Answer 2 · Fri Apr 12 2024 16:18:28 GMT+0800 (China Standard Time)

And to add our fine tuned models to the set is it simply modify: eval/configs/models/chat_musician/hf_chat_musician.py ?

Something like

model_path_mapping = {
    "ChatMusician": "m-a-p/ChatMusician",
    "ChatMusician-Base": "m-a-p/ChatMusician-Base",
    "Mozart-SImple": "/full/path/to/model",
    "Mozart-Transposed": "/full/path/to/model_transposed"
}

??