There are 7 repositories under entity-matching topic.
An open source, high scalability toolkit in Java for Entity Resolution.
Entity resolution for Elasticsearch.
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Entity Matching Model solves the problem of matching company names between two possibly very large datasets.
MetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Code for the paper "CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration". TKDE 2021.
Spark Search - high performance advanced search features based on Apache Lucene
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
A collection of awesome resources regarding Record Linkage.
A Winner-Take-All Hashing-Based Unsupervised Model for Entity Resolution Problems. [B. Sc. Thesis]
JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching
Neoplasm Entity Recognition: matching disease names to ontology classes
Master's Degree Final Project using Python & NLP
An end-to-end entity matching system
Libem sample datasets.
:coffee: Multi-source ORM for Javascript Client+Server
Fair Entity Matching: A Fairness Suite for Auditing Entity Matching Approaches
Libem notebooks.
An exploration of generalizable approaches to unsupervised entity matching for use in linking tabular public energy data sources.
Entity Matching specific Explanation tool. Landmark generates reliable and coherent explanations through a perturbation analysis.