There are 7 repositories under entity-matching topic.
An open source, high scalability toolkit in Java for Entity Resolution.
Entity resolution for Elasticsearch.
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Entity Matching Model solves the problem of matching company names between two possibly very large datasets.
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
MetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Code for the paper "CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration". TKDE 2021.
Code for the paper "PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching". VLDB 2023.
Spark Search - high performance advanced search features based on Apache Lucene
Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"
Alignment, a collaborative, system aided, user driven ontology/vocabulary matching and validation platform.
Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching" (COLING 2025)
JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
A collection of awesome resources regarding Record Linkage.
A Winner-Take-All Hashing-Based Unsupervised Model for Entity Resolution Problems. [B. Sc. Thesis]
Neoplasm Entity Recognition: matching disease names to ontology classes
Code and data for the paper "Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction" (IJCAI 2022)
Master's Degree Final Project using Python & NLP
An end-to-end entity matching system
Libem sample datasets.
:coffee: Multi-source ORM for Javascript Client+Server
Fair Entity Matching: A Fairness Suite for Auditing Entity Matching Approaches
Libem notebooks.
An exploration of generalizable approaches to unsupervised entity matching for use in linking tabular public energy data sources.
Entity Matching specific Explanation tool. Landmark generates reliable and coherent explanations through a perturbation analysis.