project-deepform / deepform

Experimental form data extraction for journalism

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Fix 2012 duplicate data problems

jstray opened this issue · comments

New 2012 data seems to have many duplicates of some documents