Thomas Flassbeck's starred repositories
table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
parsee-pdf-reader
Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-paragraphs. Full support for scans and images.
parsee-datasets
Datasets, case studies and benchmarks for extracting structured information from PDFs, HTML files or images, created by the Parsee.ai team. Datasets also on Hugging Face: https://huggingface.co/parsee-ai
parsee-core
Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular data extraction and multimodal queries.
spring-batch-examples
Spring Batch examples in Kotlin (from simple to advanced)
vue.draggable.next
Vue 3 compatible drag-and-drop component based on Sortable.js
angu-fixed-header-table
AngularJS fixed header scrollable table directive
simfin-tutorials
Tutorials for SimFin - Simple financial data for Python
SearchEngine
Search engine implementing a web crawler, fuzzy search and a simple GUI. 1st semester project
exchangeratesapi
Exchange Rates API
pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
web-api-examples
Some examples how the web-API can be used to retrieve data from SimFin.
pdf-crawler
SimFin's open source PDF crawler
angular-payments
Module that provides AngularJS-directives for formatting, validating and working with payments