princeton-nlp / SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Cropping data augmentation (Table 1)

birajpandey opened this issue · comments

In the caption for Table 1 for Gao et. al. 2022, it states that cropping keeps 100-k% of the length. When cropping, did you keep the chopped words or round it to the nearest space?

Hi,

We always keep/delete the whole word (separated by spaces).