project-deepform / deepform

Experimental form data extraction for journalism

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Pull PDFs on demand for annotation

moredatarequired opened this issue · comments

Instead of having to run with ~30G of attached PDFs for all of our source documents, since we only need a handful at the end of a given training run for annotation, we should keep them in a publicly-available known location and download them on-demand as needed.