Jakob Kofoed Janot (jakobjanot)

jakobjanot

Geek Repo

Company:Schultz

Location:Copenhagen, Denmark

Github PK Tool:Github PK Tool

Jakob Kofoed Janot's repositories

dotfiles

dotfiles

Language:ShellStargazers:2Issues:1Issues:0

unpaper

Forked unpaper repository

Language:CLicense:GPL-2.0Stargazers:1Issues:1Issues:0
Language:CSSStargazers:0Issues:0Issues:0

ConceptualSearch

Train a Word2Vec model and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

deskew

Library used to deskew a scanned document

License:MITStargazers:0Issues:0Issues:0

DIVAServices

Repository of the back end implementation of DivaServices

Language:TypeScriptLicense:LGPL-2.1Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:AGPL-3.0Stargazers:0Issues:0Issues:0

image_text_reader

The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.

License:MITStargazers:0Issues:0Issues:0

ldspider

A crawler for the Linked Data web

License:NOASSERTIONStargazers:0Issues:0Issues:0

libxml-ruby

Libxml bindings for Ruby.

Language:CLicense:MITStargazers:0Issues:0Issues:0

multimarkdown-ffi

A Multimarkdown wrapper for Ruby

Language:CLicense:MITStargazers:0Issues:1Issues:0

NER-BERT-pytorch

PyTorch solution of named entity recognition task Using Google AI's pre-trained BERT model.

License:MITStargazers:0Issues:0Issues:0

ocr-conversion

Conversions between various OCR formats

Stargazers:0Issues:0Issues:0

ocr_testing

Scripts and results from our OCR roundup, available on Source

Language:RubyStargazers:0Issues:1Issues:0

presentations

Presentations

Language:HTMLStargazers:0Issues:0Issues:0

prima-page-converter

Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as well as ALTO XML, FineReader XML, and HOCR

License:Apache-2.0Stargazers:0Issues:0Issues:0

PRLib

Pre-Recognize Library - library with algorithms for improving OCR quality.

License:MITStargazers:0Issues:0Issues:0

ruby-chardet

Charset detector. A Ruby clone of Mozilla's chardet

Language:RubyLicense:LGPL-2.1Stargazers:0Issues:0Issues:0
Language:RubyStargazers:0Issues:0Issues:0
Language:RubyStargazers:0Issues:0Issues:0
Language:RubyStargazers:0Issues:0Issues:0

saxon

Ruby wrapper for Saxon

License:MPL-2.0Stargazers:0Issues:0Issues:0

servlex

Servlex, an implementation of the EXPath Webapp framework

Stargazers:0Issues:0Issues:0

SolrPlugins

Dice Solr Plugins from Simon Hughes Dice.com

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

stanford-core-nlp-jruby

A jruby wrapper of the Stanford Core NLP package

Language:RubyLicense:GPL-3.0Stargazers:0Issues:0Issues:0

style-guides-presentation

In Defence of Style Guides, presentation for Balisage 2018

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

tesseract-recognize

Tool for doing layout analysis and OCR using tesseract in Page XML format

Language:C++License:MITStargazers:0Issues:1Issues:0

XSDtoRNG

XSL stylesheet for XML Schema (XSD) to Relax NG (RNG) conversion.

License:Apache-2.0Stargazers:0Issues:0Issues:0