Seems not work on Chinese character?
MrRace opened this issue · comments
JaonLiu commented
text1 = '**'
text2 = '**人'
words1 = list(words1)
words2 = list(words2)
print(Simhash(words1).distance(Simhash(words2)))
the result is 14
. It seems not work on Chinese ?
1e0ng commented
The simhash algorithm is more effective for articles.