Survey-of-Visual-Text-Processing

The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"

This repository contains a paper collection of recent works for visual text processing tasks.

📖 Table of Contents 👀

Text Image Super-resolution
Document Image Dewarping
Text Image Denosing
Scene Text Removal
Scene Text Editing
Scene Text Generation

Text Image Super-resolution

Boosting Optical Character Recognition: A Super-Resolution Approach (2015 arxiv) paper
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network (2017 CVPR) paper
TextSR: Content-Aware Text Super-Resolution Guided by Recognition (2019 arxiv) paper code
Selective Super-Resolution for Scene Text Images (2019 ICDAR) paper
Text-Attentional Conditional Generative Adversarial Network for Super-Resolution of Text Images (2019 ICME) paper
Collaborative Deep Learning for Super-Resolving Blurry Text Images (2020 TCI) paper
PlugNet: Degradation Aware Scene Text Recognition Supervised by a Pluggable Super-Resolution Unit (2020 ECCV) paper
Scene Text Image Super-Resolution in the Wild (2020 ECCV) paper code
Scene Text Telescope: Text-Focused Scene Image Super-Resolution (2021 CVPR) paper
Scene Text Image Super-Resolution via Parallelly Contextual Attention Network (2021 CVPR) paper
Text Prior Guided Scene Text Image Super-Resolution (2021 TIP) paper code
A text attention network for spatial deformation robust scene text image super-resolution (2022 CVPR) paper code
C3-STISR: Scene Text Image Super-resolution with Triple Clues (2022 IJCAI) [paper]
Text gestalt: Stroke-aware scene text image super-resolution (2022 AAAI) paper code
A Benchmark for Chinese-English Scene Text Image Super-Resolution (2023 ICCV) paper code
Text Image Super-Resolution Guided by Text Structure and Embedding Priors (2023 ACM MM) paper
Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network (2023 AAAI) paper code
Learning Generative Structure Prior for Blind Text Image Super-Resolution (2023 CVPR) paper code

Document Image Dewarping

A Fast Page Outline Detection and Dewarping Method Based on Iterative Cut and Adaptive Coordinate Transform (2019 ICDARW) paper
DocUNet: Document Image Unwarping via a Stacked U-Net （2018 CVPR）paper
DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks (2019 ICCV) [paper code
Document rectification and illumination correction using a patch-based CNN (2019 TOG) paper
Dewarping Document Image by Displacement Flow Estimation with Fully Convolutional Network (2020 IAPR) paper
Geometric rectification of document images using adversarial gated unwarping network (2020 PR) paper
DocScanner: Robust Document Image Rectification with Progressive Learning (2021 arxiv) paper
End-to-End Piece-Wise Unwarping of Document Images (2021 ICCV) paper
Document Dewarping with Control Points (2021 ICDAR) paper paper
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction (2021 ACM MM) paper code
Revisiting Document Image Dewarping by Grid Regularization (2022 CVPR) paper
Fourier Document Restoration for Robust Document Dewarping and Recognition ((2022 CVPR) paper
Learning an Isometric Surface Parameterization for Texture Unwrapping (2022 ECCV) paper code
Geometric Representation Learning for Document Image Rectification (2022 ECCV) paper
Learning From Documents in the Wild to Improve Document Unwarping (2022 SIGGRAPH) paper code
Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild (2023 ACM MM) paper code
DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures (2023 arxiv) paper
DocMAE: Document Image Rectification via Self-supervised Representation Learning (2023 ICME*) paper
Deep Unrestricted Document Image Rectification (2023 arxiv) paper code
Layout-aware Single-image Document Flattening (2023 TOG) paper code

Text Image Denosing

Shading Removal of Illustrated Documents (2013 ICDAR) paper
Nonparametric illumination correction for scanned document images via convex hulls (2013 TPAMI) paper
Removing shadows from images of documents (2016 ACCV) paper
Document enhancement using visibility detection (2018 CVPR) paper
Water-Filling: An Efficient Algorithm for Digitized Document Shadow Removal (2018 ACCV) paper
Learning to Clean: A GAN Perspective (2018 ACCVW) paper
Deeperase: Weakly supervised ink artifact removal in document text images (2020 WACV) paper
From Shadow Segmentation to Shadow Removal (2020 ECCV) paper
BEDSR-Net: A Deep Shadow Removal Network From a Single Document Image (2020 CVPR) paper
Light-Weight Document Image Cleanup Using Perceptual Loss (2021 ICDAR) paper
RecycleNet: An Overlapped Text Instance Recovery Approach (2021 ACM MM) paper
End-to-End Unsupervised Document Image Blind Denoising (2021 ICCV) paper
Bijective mapping network for shadow removal (2022 CVPR) paper
Style-guided shadow removal (2022 ECCV) paper code
UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior (2022 ACM MM) paper code
LP-IOANet: Efficient High Resolution Document Shadow Removal (2023 ICASSP) paper
Shadow Removal of Text Document Images Using Background Estimation and Adaptive Text Enhancement (2023 ICASSP) paper
Mask-Guided Stamp Erasure for Real Document Image (2023 ICME) paper
Document Image Shadow Removal Guided by Color-Aware Background (2023 CVPR) paper
DocDiff: Document Enhancement via Residual Diffusion Models (2023 ACM MM) paper code
DocNLC: ADocument Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations (2024 AAAI) paper code

Scene Text Removal

Image-to-Image Translation with Conditional Adversarial Networks (2017 CVPR) paper
Scene text eraser (2017 ICDAR) paper
Automatic Semantic Content Removal by Learning to Neglect (2018 BMVC) paper
Ensnet: Ensconce text in the wild (2019 AAAI) paper code
Mtrnet: A generic scene text eraser (2019 ICDAR) paper
Erasenet: End-to-end text removal in the wild (2020 TIP) paper code
Mtrnet++: One-stage mask-based scene text eraser (2020 CVIU) paper
Erasing scene text with weak supervision (2020 WACV) paper
Stroke-Based Scene Text Erasing Using Synthetic Data for Training (2021 TIP) paper
Text region conditional generative adversarial network for text concealment in the wild (2021 TCSVT) paper
Two-Stage Seamless Text Erasing On Real-World Scene Images (2021 ICIP) paper
Scene text removal via cascaded text stroke detection and erasing (2022 CVM) paper
Self-supervised text erasing with controllable image synthesis (2022 ACM MM) paper
Multi-branch network with ensemble learning for text removal in the wild (2022 ACCV) paper
The Surprisingly Straightforward Scene Text Removal Method with Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis (2022 ECCV) paper code
Don’t forget me: accurate background recovery for text removal via modeling local-global context (2022 ECCV) paper code
Psstrnet: progressive segmentation-guided scene text removal network (2022 ICME) paper
Fetnet: Feature erasing and transferring network for scene text removal (2023 PR) paper
Modeling stroke mask for end-to-end text erasing (2023 WACV) paper
Viteraser: Harnessing the power of vision transformers for scene text removal with segmim pretraining (2023 arxiv) paper code
Progressive scene text erasing with self-supervision (2023 CVIU) paper
What is the Real Need for Scene Text Removal? Exploring the Background Integrity and Erasure Exhaustivity Properties (2023 TIP) paper code
Selective scene text removal (2023 BMVC) paper code

Scene Text Editing

Scene text magnifier (2019 ICDAR) paper
Selective style transfer for text (2019 ICDAR) paper code
Editing text in the wild (2019 ACM MM) paper code
Swaptext: Image based texts transfer in scenes (2020 CVPR) paper
Scene text transfer for cross-language (2021 ICIG) paper
Mask-guided gan for robust text editing in the scene (2021 Neurocomputing) paper
Stefann: scene text editor using font adaptive neural network (2020 CVPR) paper
Deep learning-based forgery attack on document images (2021 TIP) paper
Strive: Scene text replacement in videos (2021 ICCV) paper
RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of Text Contents and Styles (2022 CVPRW) paper code
Fast: Font-agnostic scene text editing (2023 arxiv) paper
Letter Embedding Guidance Diffusion Model for Scene Text Editing (2023 ICME) paper
Exploring stroke-level modifications for scene text editing (2023 AAAI) paper code
Textstylebrush: Transfer of text aesthetics from a single example (2023 TPAMI) paper
Self-Supervised Cross-Language Scene Text Editing (2023 ACM MM) paper
Scene style text editing (2023 arxiv) paper
Improving Diffusion Models for Scene Text Editing with Dual Encoders (2023 arxiv) paper code
Towards scene-text to scene-text translation (2023 arxiv) paper
DiffUTE: Universal Text Editing Diffusion Model (2023 NIPS) paper code
On manipulating scene text in the wild with diffusion models (2024 WACV) paper

Scene Text Generation

Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition (2014 arxiv) paper
Synthetic data for text localisation in natural images (2016 CVPR) paper paper code
Text detection in traffic informatory signs using synthetic data (2017 ICDAR) paper
Verisimilar image synthesis for accurate detection and recognition of texts in scenes (2018 ECCV) paper code
Spatial Fusion GAN for Image Synthesis (2019 CVPR) paper
Learning to draw text in natural images with conditional adversarial networks (2019 IJCAI) paper
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation (2020 CVPR) paper
SynthText3D: synthesizing scene text images from 3D virtual worlds (2020 Science China Information Sciences) paper
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World (2020 arxiv) paper code
Synthtiger: Synthetic text image generator towards better text recognition models (2021 ICDAR) paper code
Vector Quantized Diffusion Model for Text-to-Image Synthesis (2022 CVPR) paper
Photorealistic text-to-image diffusion models with deep language understanding (2022 NIPS) paper
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers (2022 arxiv) paper code
Character-Aware Models Improve Visual Text Rendering (2022 arxiv) paper
Deepfloyd (2023) code
GlyphDraw: Seamlessly Rendering Text with Intricate Spatial Structures in Text-to-Image Generation (2023 arxiv) paper code
TextDiffuser: Diffusion Models as Text Painters (2023 NIPS) paper code
Glyphcontrol: Glyph conditional control for visual text generation (2023 NIPS) paper code

Cite

If you are interested in it, please star our project! And cite our paper as follows:

@article{shu2024visual,
  title={Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing},
  author={Shu, Yan and Zeng, Weichao and Li, Zhenhang and Zhao, Fangmin and Zhou, Yu},
  journal={arXiv preprint arXiv:2402.03082},
  year={2024}
}

shuyansy / Survey-of-Visual-Text-Processing