Anphisa / valentine

A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema marching methods.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Schema matcher tests based on Valentine

For my master's thesis on schema integration, I did some tests on schema matching algorithms. These are based on Valentine (https://github.com/delftdata/valentine).

Three subfolders with data needed:

Valentine was extended with JaccardLevenshtein matcher based on column names only (https://github.com/Anphisa/valentine/tree/master/valentine/algorithms/jaccard_levenshtein_colnames_only).

About

A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema marching methods.

License:Apache License 2.0


Languages

Language:Jupyter Notebook 98.7%Language:Python 1.3%