Track if input files have changed to allow condition to only process data if data are updated
mstackhouse opened this issue · comments
Michael Stackhouse commented
The basic flow would need to look like:
- Capture inputs files
- Generate a checksum of the files
- Cache the checksums someplace readable
- If subsequent run, compare against pre-existing checksums
- If input data have updated, flag that the file has changes
Maya Gans commented
Function factory?
Maya Gans commented
Dev and Prod readers with timestamp checker - function handles logistics but you can adapt and extend.
Add vignette with examples using local data, S3, whatever...?