ustczhouyu / pan-pytorch

This is an unofficial PyTorch re-implementation of paper "Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network" published in ICCV 2019.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Pixel Aggregation Network

This is an unofficial PyTorch re-implementation of paper "Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network" published in ICCV 2019, with PyTorch >= v1.4.0.

Task

  • Backbone model
  • FPEM model
  • FFM model
  • Integrated model
  • Loss Function
  • Data preprocessing
  • Data postprocessing
  • Training pipeline
  • Inference pipeline
  • Evaluation pipeline

Command

Training

python train.py --batch 32 --epoch 5000 --dataset_type ctw --gpu True

Inference

python inference.py --input ./data/CTW1500/test/text_image --model ./outputs/model_epoch_0.pth --bbox_type poly

Results

CTW1500

Statstics for CTW training

Model Precision Recall F score FPS (CPU) + pa.py FPS (1 GPU) + pa.py FPS (1 GPU) + pa.pyx
PAN-640 0.8509 0.7927 0.8208 0.3493 4.6347 21.167

TotalText

Statstics for TT training

Model Precision Recall F score FPS (CPU) + pa.py FPS (1 GPU) + pa.py FPS (1 GPU) + pa.pyx
PAN-640 0.9011 0.8040 0.8498 0.2883 7.6481 20.390

SynthText

Statstics for SynthText training

Supported Dataset

Source

[1] Original paper: https://arxiv.org/abs/1908.05900

[2] Official PyTorch code: https://github.com/whai362/pan_pp.pytorch

About

This is an unofficial PyTorch re-implementation of paper "Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network" published in ICCV 2019.

License:MIT License


Languages

Language:Python 100.0%