There are 25 repositories under record-linkage topic.
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
A powerful and modular toolkit for record linkage and duplicate detection in Python
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
:id: Examples for using the dedupe library
A list of free data matching and record linkage software.
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Spark RDD with Lucene's query and entity linkage capabilities
Link Discovery Framework for Metric Spaces.
Resources for tackling record linkage / deduplication / data matching problems
Record Linkage ToolKit (Find and link entities)
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
Python package for deduplication/entity resolution using active learning
A browser user interface for manual labeling of record pairs.
Merge Dirty Data with Clean Reference Tables
Record matching and entity resolution at scale in Spark
Phonetic Spelling Algorithms in R
A maximum-strength name parser for record linkage.
Privacy Preserving Record Linkage Service
Fork of the Freely Extensible Biomedical Record Linkage program
An End-to-End Evaluation Framework for Entity Resolution Systems
List of entity resolution software and resources.
Similarity and distance measures for clustering and record linkage applications in R
Examples of spark-lucenerdd