felipeboffnunes / Rectangulum

A generative dataset for pdf content extraction.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Rectangulum

Create scientific article formatted fake pdfs with coordinates for each data type.

About

A generative dataset for pdf content extraction.


Languages

Language:TeX 98.2%Language:Python 1.8%