There are 49 repositories under entity-resolution topic.
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
A powerful and modular toolkit for record linkage and duplicate detection in Python
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
:id: Examples for using the dedupe library
A list of free data matching and record linkage software.
Recent trends of Entity Linking, Disambiguation, and Representation.
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
ReFinED is an efficient and accurate entity linking (EL) system.
An open source, high scalability toolkit in Java for Entity Resolution.
Construct knowledge graphs from unstructured data sources, use graph algorithms for enhanced GraphRAG with a DSPy-based chat bot locally, and curate semantics for optimizing AI app outcomes within a specific domain.
Entity resolution for Elasticsearch.
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
Resources for tackling record linkage / deduplication / data matching problems
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Record Linkage ToolKit (Find and link entities)
List of entity resolution software and resources.
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Python package for deduplication/entity resolution using active learning
This repository contains code and extensive prompt examples to reproduce and extend the experiments in our papers "Using ChatGPT for Entity Matching" and "Entity Matching using Large Language Models".
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Low effort linking and easy de-duplication. Databricks ARC provides a simple, automated, lakehouse integrated entity resolution solution for intra and inter data linking.
ReCiter: an enterprise open source author disambiguation system for academic institutions
A browser user interface for manual labeling of record pairs.
A maximum-strength name parser for record linkage.
Welcome to Snowman App – a Data Matching Benchmark Platform.
This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peeters, Christian Bizer and Goran Glavaš.