There are 23 repositories under record-linkage topic.
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
A powerful and modular toolkit for record linkage and duplicate detection in Python
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
:id: Examples for using the dedupe library
A list of free data matching and record linkage software.
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Spark RDD with Lucene's query and entity linkage capabilities
Link Discovery Framework for Metric Spaces.
Resources for tackling record linkage / deduplication / data matching problems
Record Linkage ToolKit (Find and link entities)
Python package for deduplication/entity resolution using active learning
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
A browser user interface for manual labeling of record pairs.
Merge Dirty Data with Clean Reference Tables
Phonetic Spelling Algorithms in R
Record matching and entity resolution at scale in Spark
Privacy Preserving Record Linkage Service
A maximum-strength name parser for record linkage.
Fork of the Freely Extensible Biomedical Record Linkage program
An End-to-End Evaluation Framework for Entity Resolution Systems
Similarity and distance measures for clustering and record linkage applications in R
Examples of spark-lucenerdd