CLS of which layers to use in Condenser? last layer CLS? sum of last four layers CLS?

Question

CLS of which layers to use in Condenser? last layer CLS? sum of last four layers CLS?

mahdiabdollahpour opened this issue 2 years ago · comments

Mohammad Mahdi Abdollahpour commented 2 years ago

Hi
Thanks for the nice repo. After pretraining, Condenser has the same architecture as BERT (condenser heads are removed). Which CLS layers worked best for neural IR? last layer CLS? the sum of the last four layers CLS? ....

Luyu Gao · Answer 1 · Fri Apr 01 2022 14:11:49 GMT+0800 (China Standard Time)

We fine-tune the last backbone layer's CLS which is the one passed to the head during pre-training.

Luyu Gao · Answer 2 · Fri Apr 15 2022 11:02:46 GMT+0800 (China Standard Time)

Closing for now. Feel free to re-open if you have new questions.