John (johndocs)

johndocs

Geek Repo

Company:I'm John

Location:Santa Monica

Github PK Tool:Github PK Tool

John's repositories

Language:RLicense:MITStargazers:0Issues:0Issues:0
Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

docnet

DocNET is as fast PDF editing and reading library for modern .NET applications

Language:C#License:MITStargazers:0Issues:0Issues:0

finetune

Scikit-learn style model finetuning for NLP

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

go-pdfium-render

A Go library that uses pdfium (via cgo) to render pdfs to images

Language:GoStargazers:0Issues:0Issues:0

johndocs

Config files for my GitHub profile.

Stargazers:0Issues:1Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

paperless-ng

A supercharged version of paperless: scan, index and archive all your physical documents

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

pdf-anonymizer

A script to anonymize PDFs

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

pdf-corpora

An index of PDF-centric corpora

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

pdf-js-csv

Exploring extracting tables from a PDF to CSV using PDF.JS

Language:JavaScriptStargazers:0Issues:0Issues:0

pdf2json

A PDF file parser that converts PDF binaries to text based JSON, powered by a fork of PDF.JS

License:NOASSERTIONStargazers:0Issues:0Issues:0

pdfcpu

A PDF processor written in Go.

License:Apache-2.0Stargazers:0Issues:0Issues:0

pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

License:MITStargazers:0Issues:0Issues:0