Set another seed won't change the result

Question

Set another seed won't change the result

JunyiZhu-AI opened this issue 2 years ago · comments

Hi Xuechen,

I have another issue with the training seed. I would like to relax the random seed so that I can get some statistical results. Tried many different ways but even comment out the set_seed() function, the eva acc is the same until the last digit. May I ask how to relax the random seed? I'm doing experiments on examples/classification.

Thanks!

Xuechen Li · Answer 1 · Mon Dec 13 2021 03:57:08 GMT+0800 (China Standard Time)

set_seed should be the only place where the seed (and randomness) is controlled. Have you tried using larger and more diverse values for the seed argument?

Junyi Zhu · Answer 2 · Mon Dec 13 2021 04:43:03 GMT+0800 (China Standard Time)

yes, here is what I have tried:

 # Set seed
 seed = np.random.randint(0, 1000000)
 set_seed(seed)
...
 set_seed(seed)

I have run several times, the eval acc is always the same.

Xuechen Li · Answer 3 · Mon Dec 13 2021 12:00:55 GMT+0800 (China Standard Time)

That sounds really strange. Could you remove the set_seed functions altogether?

It's really hard to pinpoint the problem without additional context. Does this still happen if you use different output_dir for different seeds?

One additional thing to check is if the model checkpoint stored in output_dir is updated after you re-run your script. This may affect evaluation since the checkpoint is restored before evaluation.

Junyi Zhu · Answer 4 · Mon Dec 13 2021 19:53:41 GMT+0800 (China Standard Time)

Indeed, I have tried commenting out set_seed and rm -fr $output_dir, then running the algorithm. The eval acc is still the same. But when I claim the argument seed and give a large random number, the result is changed. This is done after commenting out the set_seed function, so I guess the seed has been used somewhere else.

@dataclass
class DynamicTrainingArguments(TrainingArguments):
    seed: int = field(
        default=0,
        metadata={"help": "Seed."}
    )

I'm closing this comment since the problem is solved. Thanks for the response!

Xuechen Li · Answer 5 · Tue Dec 14 2021 01:12:17 GMT+0800 (China Standard Time)

I think I've pinpointed the issue. If you're using the latest transformers library, then Trainer also sets seed in its __init__ function; see this line.

Junyi Zhu · Answer 6 · Tue Dec 14 2021 03:54:35 GMT+0800 (China Standard Time)

I think that should be the reason of this issue, thanks for the information.