Datalab (datalab-to)

Datalab

datalab-to

Organization data from Github https://github.com/datalab-to

Developing state of the art document intelligence models.

Location:United States of America

Home Page:https://www.datalab.to

GitHub:@datalab-to

Twitter:@datalabto

Datalab's repositories

marker

Convert PDF to markdown + JSON quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:29811Issues:107Issues:617

surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:18878Issues:122Issues:264

pdftext

Extract structured text from pdfs quickly

Language:PythonLicense:Apache-2.0Stargazers:624Issues:8Issues:16

docext

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0
Language:PythonLicense:MITStargazers:5Issues:0Issues:4
Language:PythonStargazers:2Issues:0Issues:0

datalab-on-prem

Scripts to run Datalab's self-service on-prem container

Language:ShellStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0