usc-isi-i2 / Web-Karma

Information Integration Tool

Home Page:http://www.isi.edu/integration/karma/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Dataset Column mappings

vagarwal77 opened this issue · comments

Describe the bug
I have 2 large datasets each are having app. 45 tables each with app 20 columns each.
I would like to map the columns from one dataset tables to another dataset tables either based upon column name matching or column descriptors or any other semantic way.
Please suggest if Karma or any other Python library can perform this task?

Any pointer would be of great help.

To Reproduce
Steps to reproduce the behavior:

  1. Go to '...'
  2. Click on '....'
  3. Scroll down to '....'
  4. See error

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information):

  • OS: [e.g. iOS]
  • Browser [e.g. chrome, safari]
  • Version [e.g. 22]

Additional context
Add any other context about the problem here.

You can create an ontology mapping for one dataset. Then when you load a new dataset, Karma will recommend labels for the new data depending on column similarity. The column similarity is decided based on column names, data in the columns etc.