princeton-nlp / SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How do you get the supervised nli dataset?

leoozy opened this issue · comments

Do you sampled from the SNLI+MNLI or directly use the whole dataset?

We directly combine SNLI + MNLI and use all data. Though some data might be filtered out because there are some that don't have a corresponding hard negative (contradiction label).

Stale issue message