What hidden states should I use?

Question

zhangzhenyu13 opened this issue a year ago · comments

the output of encoder is N*d states for each text input;

simcse use the 1st one as the output.
the sentence-transformers use the mean pooling(only for attention mask=1 states).

Your default is the same as mean pooling?

yuxin.wang · Answer 1 · Wed Jun 28 2023 18:12:10 GMT+0800 (China Standard Time)

yes, use the mean pooling(only for attention mask=1 states)