DiTo97 / neural-binarization

A toolkit for efficient document image binarization (DIB) as a semantic segmentation problem

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Binarization Segformer

A semantic segmentation model for pixel-wise document image binarization.

TODOs

  • fine-tune Segformer on 1024 $\times$ 1024 images;
  • set reduce_labels=True in Segformer processor to ignore the background;
  • compare valid DIBCO metrics with SauvolaNet's paper.

Overview

Segformer is an efficient semantic segmentation model introduced by Xie et al. in 2021.

In this repository, we will provide a fine-tuning of Segformer for pixel-wise document image binarization.

Dataset

The dataset is an ensemble of 14 datasets replicating the setting used in SauvolaNet by Li et al. in 2021.

Figure 1. An example pair from the Bickley diary dataset

For more information on the dataset, see SauvolaNet's official repository.

About

A toolkit for efficient document image binarization (DIB) as a semantic segmentation problem

License:MIT License


Languages

Language:Jupyter Notebook 70.6%Language:Python 29.4%