datahub-project / datahub

The Metadata Platform for your Data Stack

Home Page:https://datahubproject.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Glossary term mismatch between the classifier and UI

ms32035 opened this issue · comments

Describe the bug
When I use the Snowflake field classification feature during metadata ingestion, the assigned terms have URNs like
urn:li:glossaryTerm:IP%20Address and these are not visible in the UI.

When I manually create create a term from the UI, the URN uses UIDs, for example:
urn:li:glossaryTerm:6782cc0a-b9c3-451a-a523-f21692ef401f

As a result the whole automatic classification feature is kind of useless

To Reproduce
Ingest snowflake data with classification enabled

Expected behavior
Consistent term URNs between the ui and the classified.

Screenshots
image

Desktop (please complete the following information):

@ms32035 you can use the "advanced" section to ensure that the urns match. You can also managed the terms generated by the classifier in the UI.

image

@hsheth2 thank's for the suggestion with a custom term ID. My term had a space in the classifier, so that's a case that can't be input through the UI, but I can replace with an underscore. At the same, the autogenerated terms don't show up in the UI.

@ms32035 that piece of it should be fixed by this #10249