facebookresearch/SemDeDup Issues
implementation clarification
UpdatedHow can I create embeddings?
Updatedwhere is "submit_semdedup_job.py"?
Updated 1
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).