tensorflow / data-validation

Library for exploring and validating machine learning data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TFDV Anomaly for distribution constraints min_domain_mass

cah-aswini-jalla opened this issue · comments

Hi,

I was trying to relax the fraction of a string value as below. But it was causing an anomaly(attached image) which I couldn't understand. Is there a way I can set min domain mass and avoid this anomaly.

schema = tfdv.infer_schema(old_dataset_stats)
date_feature = tfdv.get_feature(schema, 'date')
date_feature.distribution_constraints.min_domain_mass = 0.5
anomalies = tfdv.validate_statistics(statistics=new_dataset_stats, schema=schema)
display_anomalies(anomalies)

Thanks.
anomaly_date