Dedupe.io's repositories
dedupe-examples
:id: Examples for using the dedupe library
dedupe-geocoder
:round_pushpin: Demonstration of how dedupe might be used as geocoder
doublemetaphone
:sound: Python wrapper for a C++ Double Metaphone
fuzzycategory
:triangular_ruler: Fuzzy Categorical Distances
dedupe-variable-address
Address Variable Type for dedupe
dedupe-variable-person
Dedupe variable for person names. just people. no companies.
dedupe-variable-name
name variable type for dedupe
dedupeio-web-api-docs
Dedupe.io web API allows for matching and training against projects using a standard RESTful framework.
Levenshtein_search
Python search module for fast approximate string matching
soft-tfidf
Mispelling tolerant tf-idf similarity metric
categorical-distance
:triangular_ruler: Compare categorical variables
dedupe-variable-datetime
DateTime variable for dedupe
dedupe-variable-fuzzycategory
Dedupe Variable for Fuzzy Categories
dedupe-vowpal
Vowpal Wabbit Active Labeler for Dedupe
learned-string-alignments
Learning String Alignments for Entity Aliases
datetime-distance
📐 Compare dates and times
dedupe-variable-number
Try to cast strings to numbers, then compare
parseratorvariable
Base class for dedupe variables for parsed fields
simplecosine
:triangular_ruler: simple cosine distance
dedupe-variable-ilcs
Dedupe variable for Illinois Compiled Statute (ILCS) codes
fastcluster
Fast hierarchical clustering routines for R and Python.
dedupe-variable-embedding
Use embeddings for semantic comparisons