This is an attempt to repeat the ChEMBL duplicate test as described in O'Boyle (2012).
To run the workflow:
-
Ensure Open Babel v2.3.2 (the version used in the publication) is installed and executable using
obabel
. -
Install
IPC::Run3
Perl module. -
Execute
make distclean
to cleanup precomputed results. -
Execute
make
(-j
should help to speed up a bit using parallel processes).
ChEMBL v13 (version used in the publication) data is downloaded on-the-fly and is licensed under CC-BY-SA 3.0 Unported. The Makefile is CC0.