parsee-ai / parsee-datasets

Datasets, case studies and benchmarks for extracting structured information from PDFs, HTML files or images, created by the Parsee.ai team. Datasets also on Hugging Face: https://huggingface.co/parsee-ai

Home Page:https://parsee.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Parsee.ai Datasets

Datasets, case studies and benchmarks for extracting structured information from PDFs, HTML files or images, created by the Parsee.ai team.

finRAG

Extracting revenue figures from publicly available financial reports using text or images.

Document Loader Comparisons

We will be adding datasets here on a continuing basis.

About

Datasets, case studies and benchmarks for extracting structured information from PDFs, HTML files or images, created by the Parsee.ai team. Datasets also on Hugging Face: https://huggingface.co/parsee-ai

https://parsee.ai

License:MIT License


Languages

Language:Jupyter Notebook 97.9%Language:Python 2.1%