AI-SDC / ACRO

Tools for the Automatic Checking of Research Outputs. These are the tools for researchers to use as drop-in replacements for commands that produce outputs in Stata Python and R

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Adding information to flag TRE staff to look for disclosure by differencing

jim-smith opened this issue · comments

Since we have the commands and the table it should not be too hard to capture:
Each row/column name in a table:

  • and what the set of unique values for them is
  • and what the total count of records in a table is.

That would allow for some simple checks against differencing

  • at minimum we could want which outputs report on similar attributes
  • and if two tables share the same attributes but have different counts, we would know that an exclusion operation had ben applied which might leave the excluded people vulnerable
    • e.. if table cell for some combination of attributes has count 11, and in a second table with smaller total combat, the cell has value 10, then we can infer that a table made from the excluded set would have 1 in that cell.

duplicate of #117 which is more detailed