decide the n_layers

Question

decide the n_layers

leehelenah opened this issue 4 years ago · comments

Hello,

Thanks for the nice implementation.
I notice you set n_layers= 1 in conf/train.json
I thought most of the time, people set n_layers to 6 or even higher in their experiments.
Would that be a reason that the Transformer model doesn't outperform RCNN in your results? Thank you.

lipengyu · Answer 1 · Tue Jul 28 2020 10:47:00 GMT+0800 (China Standard Time)

Transformer have more parameters than RCNN, which need more data to fit it. It's also the reason that transformer-based pretrain LM models needs huge corpus. So if you have a large dataset, maybe the result will be different slightly.