There are 7 repositories under entity-matching topic.
An open source, high scalability toolkit in Java for Entity Resolution.
Entity resolution for Elasticsearch.
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
MetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Entity Matching Model solves the problem of matching company names between two possibly very large datasets.
Code for the paper "CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration". TKDE 2021.
Spark Search - high performance advanced search features based on Apache Lucene
Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"
Code for the paper "PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching". VLDB 2023.
An open-source compound AI toolchain for fast and accurate entity matching, powered by LLMs.
Alignment, a collaborative, system aided, user driven ontology/vocabulary matching and validation platform.
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
A collection of awesome resources regarding Record Linkage.
A Winner-Take-All Hashing-Based Unsupervised Model for Entity Resolution Problems. [B. Sc. Thesis]
Code and data for the paper "Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction"
Neoplasm Entity Recognition: matching disease names to ontology classes
Libem sample datasets.
:coffee: Multi-source ORM for Javascript Client+Server
JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching
Master's Degree Final Project using Python & NLP
Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching"
Fair Entity Matching: A Fairness Suite for Auditing Entity Matching Approaches
Libem notebooks.
An exploration of generalizable approaches to unsupervised entity matching for use in linking tabular public energy data sources.
Performed entity matching on Album music data across two different (extracted) tables from metacritic.com and wikipedia.
This repository is a supplement resource for a research article entitled "Deep Learning Untuk Entity Matching Produk Kamera Antar Online Store Menggunakan DeepMatcher"