Bram Zijlstra's repositories
awesome-dutch-nlp
A curated list for references to Dutch NLP libraries, datasets, and interesting literature.
Opensubtitles_dataset
downloads and parses subtitle dataset from opensubtitles.org
opus-api
OPUS (opus.nlpl.eu) Python3 API
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
SynthText
Code for generating synthetic Dutch text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016. This repo is a fork