MarcGrotheer's repositories
core
Collection of OCR-related python tools and wrappers from @OCR-D
invoice2data
Extract structured data from PDF invoices
deep-learning-german-tts
The free german voice dataset.
awesome-ocr
Links to awesome OCR projects
xrechnung-visualization
XSL transformators for web rendering of German CIUS XRechnung or EN16931-1:2017
handsontable
JavaScript/HTML5 Data Grid Component with Spreadsheet Look & Feel. Available for React, Vue and Angular.
openrpa
Free Open Source Enterprise Grade RPA
PdfPig
Read and extract text and other content from PDFs in C# (port of PdfBox)
PdfPigMLNetBlockClassifier
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
lopdf
A Rust library for PDF document manipulation.
clawPDF
Open Source virtual PDF printer for Windows // Print to PDF, PDF/A, PDF/X, PNG, JPEG, TIF and text
Net-Core-DocX-HTML-To-PDF-Converter
.NET Core library to create custom reports based on Word docx or HTML documents and convert to PDF
Remotely
A remote control and remote scripting solution, built with .NET Core and SignalR Core.
hocrjs
Working with hOCR in Javascript
PdfiumViewer
PDF viewer based on Google's PDFium.
OutlookFileDrag
Drag and drop Outlook items as files into any application
isometric-contributions
Browser extension for rendering an isometric pixel art version of your GitHub contribution graph.
Hocr
C# Library for converting PDF files to Searchable PDF Files
html-invoice-generator
JavaScript tool that will transform your HTML invoice template to fully functional invoice editor
darkroomjs
[UNMAINTAINED] Extensible image editing tool in your browser
OCR-Invoice
a console application that would run on Windows server to scan user’s Bill and Receipts, which are either captured by camera or in form of an electronic file like pdf etc. 1. All the invoices/receipts will be uploaded on server in a folder 2. The uploaded invoices/receipts will be scanned by OCR app and extract following information from the file and put them in database table - Vendor/Party Name - Invoice date - Tax amount - Total amount - Line items(Item Name, Item Qty, Item rate, Item Tax & Item Amount) 3. The processing of OCR should be done with 90% of accuracy 4. Application designed be able to handle the noise & quality of the uploaded invoice images.
pdfjs-viewer
A built bower version of PDFJS
hocrimagemapper
Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).
xpath-tester
A simple online tester for all your XPath queries, using a little bit of PHP and JQuery.