moj-analytical-services / splink_datasets

Repo containing Splink's In-built datasets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data previews

ADBond opened this issue · comments

commented

Might be nice to have a notebook or something showing the first few rows of each dataset to get an idea of what they look like - particularly useful for parquet files as they are not directly inspectable

commented

In particular it might be nice to generate these here for inclusion in Splink docs - downside of doing it on the Splink side is that we would need to download the data when building the docs.