There are 33 repositories under entity-resolution topic.
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
A powerful and modular toolkit for record linkage and duplicate detection in Python
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Insightful Tutorials and Papers about Knowledge Graphs
:id: Examples for using the dedupe library
A list of free data matching and record linkage software.
Recent trends of Entity Linking, Disambiguation, and Representation.
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
An open source, high scalability toolkit in Java for Entity Resolution.
ReFinED is an efficient and accurate entity linking (EL) system.
Entity resolution for Elasticsearch.
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Resources for tackling record linkage / deduplication / data matching problems
Record Linkage ToolKit (Find and link entities)
Python package for deduplication/entity resolution using active learning
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
A browser user interface for manual labeling of record pairs.
Learning String Alignments for Entity Aliases
Welcome to Snowman App – a Data Matching Benchmark Platform.
Merge Dirty Data with Clean Reference Tables
This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peeters, Christian Bizer and Goran Glavaš.
Low effort linking and easy de-duplication. Databricks ARC provides a simple, automated, lakehouse integrated entity resolution solution for intra and inter data linking.