moj-analytical-services / splink

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Home Page:https://moj-analytical-services.github.io/splink/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[FEAT] Linkage stats

ADBond opened this issue · comments

On a semi-related note to graph metrics, it might be useful to provide a simple method to create some 'linkage-level' summary stats, e.g.:

  • total number of clusters
  • total number of nodes (original records)
  • total number of links scored