hecor / TextDeduplication

(fuzzy)duplication detection for texts from not too large corpora.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TextDeduplication

(fuzzy)duplication detection for texts from not too large corpora.

About

(fuzzy)duplication detection for texts from not too large corpora.


Languages

Language:C 82.8%Language:C++ 17.2%