arx-deidentifier / arx

ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.

Home Page:http://arx.deidentifier.org/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[BUG] Setting max search space size doesn't work

prasser opened this issue · comments

Describe the bug

I get the following message when trying to anonymize a dataset with ~900.000 (global) transformations, although I set the limit to 1.000.000 in the settings dialog:

image

A sample dataset to reproduce the problem is the American Community Survey (ACS). The CSV file can be downloaded from here.

I don't think that this is really a good example. First, an example for reproducing this would need a dataset + hierarchies. Second, the standard configuration that we use for SS13ACS consists of 34131947028480 (global) transformations. This is much larger than 2^31-1 (=2147483647) and ARX generally doesn't support optimal search for spaces consisting of more than 2^31-1 (global) transformations for other reasons.