There are 0 repository under shingling topic.
Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.
Implementation of algorithms for big data using python, numpy, pandas.
A Java program to check Plagiarisms between multiple documents using the method of Shingling, MinHashing and Locality Sensitive Hashing.
A functional rewrite of the `schindel` library
Duplicate Detection on Hoaxy Dataset
Implementing Locality Sensitive Hashing for DNA Sequences.
Finding Similar Items: Textually Similar Documents
Finding Similar Items: Textually Similar Documents