pd3f's repositories
pd3f-dataset-bmjv
Dataset of (mostly German) PDFs used to develop pd3f
pd3f-results
Results with pd3f on some PDF datasets
PDF text extraction pipeline: self-hosted, local-first and Docker-based
Dataset of (mostly German) PDFs used to develop pd3f
Results with pd3f on some PDF datasets