Frank B. (frankiert)

frankiert

Geek Repo

Location:Leipzig, Germany

Twitter:@frankiert

Github PK Tool:Github PK Tool

Frank B.'s repositories

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-data-labeling

A curated list of awesome data labeling tools

Stargazers:0Issues:0Issues:0

Awesome-Table-Recognition

A curated list of resources dedicated to table recognition

Stargazers:0Issues:0Issues:0

BIG-bench-1

Beyond the Imitation Game collaborative benchmark for enormous language models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

CRASS-data-set

The data for the CRASS-benchmark. See: https://www.crass.ai for further information.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

doc-hcii2022-slides

Slides to our HCII 2022 talk on "Putting users in the loop: How User Research Can Guide AI Development for a Consumer-Oriented Self-service Portal". Imported from https://git.informatik.uni-leipzig.de/smarthec/doc-hcii2022-slides

Stargazers:0Issues:1Issues:0

docquery

An easy way to extract information from documents

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DocumentLayoutAnalysis

Document Layout Analysis resources repos for development with PdfPig.

Language:C#Stargazers:0Issues:0Issues:0

GastCluster

A set of bash scripts to spread number crunching jobs across several machines and collect the results back into a single file

License:Apache-2.0Stargazers:0Issues:1Issues:0

layout-parser

A Python Library for Document Layout Understanding

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ocrd_segment

OCR-D-compliant page segmentation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pdfix_sdk_example_cpp

Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...

Stargazers:0Issues:0Issues:0

pdfix_sdk_example_python

PDFix SDK samples for Python. PDF manipulation, content extraction, conversion , accessibility and more...

Language:PythonStargazers:0Issues:0Issues:0

PLIX

PLIX (Pipeline for Information Extraction) is a Python package and command line tool for information extraction from (PDF) documents.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SciTSR

Table structure recognition dataset of the paper: Complicated Table Structure Recognition

License:MITStargazers:0Issues:0Issues:0

todo.md

TODO.md file format - todomd.org

Stargazers:0Issues:0Issues:0