seatgeek / thefuzz

Fuzzy String Matching in Python

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

image

TheFuzz

Fuzzy string matching like a boss. It uses Levenshtein Distance to calculate the differences between sequences in a simple-to-use package.

Requirements

For testing

  • pycodestyle
  • hypothesis
  • pytest

Installation

Using pip via PyPI

Using pip via GitHub

Adding to your requirements.txt file (run pip install -r requirements.txt afterwards)

Manually via GIT

Usage

Simple Ratio

Partial Ratio

Token Sort Ratio

Token Set Ratio

Partial Token Sort Ratio

Process

You can also pass additional parameters to extractOne method to make it use a specific scorer. A typical use case is to match file paths:

About

Fuzzy String Matching in Python

License:MIT License


Languages

Language:Python 90.8%Language:Shell 9.2%