rapidfuzz / RapidFuzz

Rapid fuzzy string matching in Python using various string metrics

Home Page:https://rapidfuzz.github.io/RapidFuzz/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RFE: Api to extract equal best scores

i30817 opened this issue · comments

I normalized my dataset so much that several names started - correctly - giving the same score.

This is ... sort of ok, because for me the alternative is to aleatory download or not, depending on which 'best score' wins, while using extractOne, because the name is key to a set of 3 or less downloads, and depending on which name 'wins' sometimes there is a extra or not.

Obviously i should be using extract with a limit and sorting the list then checking, but i wonder if this could be made easier by making a alternate 'extractBest' as a companion to extract one, that would return a ([scorer processed str], score, [original str]).

Never mind this, putting it on a list would only make it worse to use i guess.