Loovelj / scene_text

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

scene_text

Text Detection

Pyramid Mask Text Detector -sensetime, arxiv2019
Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes -baidu, arxiv2019
Character Region Awareness for Text Detection -Clova, CVPR2019
Detecting Text in the Wild with Deep Character Embedding Network -baidu, arxiv2019
TextField: Learning A Deep Direction Field for Irregular Scene Text Detection -Yongchao Xu, Yukang Wang, Wei Zhou, Yongpan Wang, Zhibo Yang, Xiang Bai, arxiv2018
TextMountain: Accurate Scene Text Detection via Instance Segmentation -Yixing Zhu, Jun Du, arxiv2018
Mask R-CNN with Pyramid Attention Network for Scene Text Detection -MSRA, arxiv2018
Scene Text Detection with Supervised Pyramid Context Network -face++, AAAI2019
Pixel-Anchor: A Fast Oriented Scene Text Detector with Combined Networks -cloudwalk, arxiv2018
Improving Rotated Text Detection with Rotation Region Proposal Networks -facebook, arxiv2018
IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection -Alibaba, IJCAI2018
TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes -peking, face++, arxiv2018
PSENET: Shape Robust Text Detection with Progressive Scale Expansion Network -deepinsight, arxiv2018
Arbitrary-Oriented Scene Text Detection via Rotation Proposals -J Ma, W Shao, H Ye, L Wang, H Wang, TMM2018
TextBoxes++: A Single-Shot Oriented Scene Text Detector -Minghui Liao, Baoguang Shi, Xiang Bai, arxiv2018 code
R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection -Samsung, arxiv2018
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation -Pengyuan Lyu, Cong Yao, Wenhao Wu, Shuicheng Yan, Xiang Bai, arxiv2018
PixelLink: Detecting Scene Text via Instance Segmentation -Dan Deng, Haifeng Liu, Xuelong Li, Deng Cai, aaai2018
EAST: an efficient and accurate scene text detector -Megvii, cvpr2017, code
Scene text detection and segmentation based on cascaded convolution neural networks -Y Tang, X Wu, TIP2017
TextBoxes: A Fast Text Detector with a Single Deep Neural Network. -M Liao, B Shi, X Bai, X Wang, W Liu, AAAI2017, code
Deep direct regression for multi-oriented scene text detection -W He, XY Zhang, F Yin, CL Liu, ICCV2017
Detecting oriented text in natural images by linking segments -B Shi, X Bai, S Belongie, CVPR2017, code
Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection -Yuliang Liu, Lianwen Jin, CVPR2017
Feature Enhancement Network: A Refined Scene Text Detector -Sheng Zhang, Yuliang Liu, Lianwen Jin, Canjie Luo, arxiv2017
Single Shot Text Detector with Regional Attention -Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, and Xiaolin Li, ICCV2017
A Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling -Xiaohang Ren, Yi Zhou, Jianhua He, Kai Chen, Xiaokang Yang, Jun Sun, TMM2017
Fused Text Segmentation Networks for Multi-oriented Scene Text Detection -Yuchen Dai, et al, arxiv2017
Scene Text Detection with Novel Superpixel Based Character Candidate Extraction -Cong Wang, Fei Yin, Cheng-Lin Liu, ICDAR2017
WeText: Scene Text Detection under Weak Supervision -Shangxuan Tian, Shijian Lu, Chongshou Li, ICCV2017
WordSup: Exploiting Word Annotations for Character based Text Detection -MSRA, IDL, ICCV2017
Deep Residual Text Detection Network for Scene Text -Xiangyu Zhu, et al, arxiv2017
Cascaded Segmentation-Detection Networks for Word-Level Text Spotting -Siyang Qin, Roberto Manduchi, arxiv2017
Arbitrary-Oriented Scene Text Detection via Rotation Proposals -Jianqi Ma, et al, TMM2017
Multi-oriented text detection with fully convolutional networks -Z Zhang, C Zhang, W Shen, C Yao, CVPR2016
Scene text detection via holistic, multi-channel prediction -C Yao, X Bai, N Sang, X Zhou, S Zhou, arxiv2016

Text Recognition

FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition -Qingqing Wang, et al, arxiv2019
A Multi-Object Rectified Attention Network for Scene Text Recognition -Canjie Luo, Lianwen Jin, Zenghui Sun, PR2019
Recurrent Calibration Network for Irregular Text Recognition -Hanqing Lu, arxiv2018
ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification -Fangneng Zhan, Shijian Lu, arxiv2018
Synthetically Supervised Feature Learning for Scene Text Recognition -Adobe, ECCV2018
ASTER: An Attentional Scene Text Recognizer with Flexible Rectification -Baixiang, PAMI2018
Edit Probability for Scene Text Recognition -Fudan, Hikvision, cvpr2018
Scene Text Recognition from Two-Dimensional Perspective -Minghui Liao, Cong Yao, Xiang Bai, et al, arxiv2018
SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network -Zichuan Liu, et al, AAAI2018
SCAN: Sliding Convolutional Attention Network for Scene Text Recognition -Yichao Wu, et al, arxiv2018
NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition -Fenfen Sheng, et al, arxiv2018
AON: Towards Arbitrarily-Oriented Text Recognition -Hikvision, et al, CVPR2018
An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition -B Shi, X Bai, C Yao , TPAMI2017 code
Scene Text Recognition with Sliding Convolutional Character Models -fei yin, et al, arxiv2017
Focusing Attention: Towards Accurate Text Recognition in Natural Images -Hikvision, et al, ICCV2017
AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition -Chun Yang, Xu-Cheng Yin, arxiv2017
Strokelets: A learned multi-scale mid-level representation for scene text recognition -X Bai, C Yao, W Liu , TIP2016
Reading Scene Text in Deep Convolutional Sequences -P He, W Huang, Y Qiao, CC Loy, X Tang, AAAI2016
Text-Attentional Convolutional Neural Network for Scene Text Detection -Tong He, Weilin Huang, Yu Qiao, Jian Yao, TIP2016
Robust Scene Text Recognition with Automatic Rectification -Baoguang Shi, Xinggang Wang, Pengyuan Lyu, Cong Yao, Xiang Bai, CVPR2016
DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images -Zhuoyao Zhong, Lianwen Jin, Shuye Zhang, Ziyong Feng, arxiv2016
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild -Yahoo, CVPR2016

End-to-End & Text Spotting

A Novel Integrated Framework for Learning both Text Detection and Recognition -alibaba, arxiv2018
TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network -baidu, arxiv2018
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes -Pengyuan Lyu, Minghui Liao, Cong Yao, Wenhao Wu, Xiang Bai, arxiv2018
FOTS: Fast Oriented Text Spotting with a Unified Network -Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao, Junjie Yan, CVPR2018
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text -Yash Patel, et al, arxiv2018
SEE: Towards Semi-Supervised End-to-End Scene Text Recognition -Christian Bartz, Haojin Yang, Christoph Meinel, AAAI2018
An end-to-end TextSpotter with Explicit Alignment and Attention -Tong He, Zhi Tian, Weilin Huang, Chunhua Shen, Yu Qiao, Changming Sun, CVPR2018
Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks -Hui Li, et al, ICCV2017
Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework -Michal Busta, et al, ICCV2017, code
Reading Text in the Wild with Convolutional Neural Networks -Max Jaderberg, et al, IJCV2016

Other

Scene Text Detection and Recognition: The Deep Learning Era -face++, arxiv2018
Text/non-text image classification in the wild with convolutional neural networks -X Bai, B Shi, C Zhang, X Cai, L Qi, PR2017
Scene text script identification with convolutional recurrent neural networks -J Mei, L Dai, B Shi, X Bai, ICPR2016

Seq2Seq

Convolutional Sequence to Sequence Learning -FAIR, ICML2017
Sequence Level Training with Recurrent Neural Networks -FAIR, ICLR2016
A Convolutional Encoder Model for Neural Machine Translation -FAIR, arxiv2016

Database & Generation

chinese

TRW15: ICDAR 2015 Text Reading in the Wild Competition
RCTW-17: ICDAR2017-Reading Chinese Text in the Wild
STV2k: A New Benchmark for Scene Text Detection and Recognition
CTW: Chinese Text in the Wild
PAL10K
COCO TS Dataset
ICPR MTWI 2018 挑战赛一:网络图像的文本识别

other

Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes -Fangneng Zhan, Shijian Lu, and Chuhui Xue, arxiv2018
Total-Text -1555 images
SCUT-CTW1500 -Curved text in the wild
MLT: Multi-lingual scene text detection and script identification -Multi-lingual text: 18,000 images, 9 different languages representing 6 different scripts
Synthetic Word Dataset, Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition
Total-text: A comprehensive dataset for scene text detection and recognition - -Chee Kheng Ch'ng, Chee Seng Chan
Street View Text(SVT)
IIIT 5k-words
MSRA-TD500
KAIST Scene_Text Database
ICDAR2011, ICDAR2013, ICDAR2015, ICDAR2017, robust reading-Focused Scene Text
ICDAR2017-ICDAR 2017 Robust Reading Challenge on Omnidirectional Video(DOST)
COCO-Text
Google French Street Name Signs (FSNS) dataset
ICDAR2017-ICDAR2017 Competition on Multi-lingual scene text detection and script identification(MLT)
ICDAR2017-Born-Digital Images (Web and Email)
Detecting Curve Text in the Wild: New Dataset and New Solution
Synthetic Word
Synthetic Data for Text Localisation in Natural Images -Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR2016

Competition

ICDAR2017 Competition on Reading Chinese Text in the Wild (RCTW-17) -B Shi, C Yao, M Liao, M Yang, P Xu, L Cui, arxiv2017
ICDAR 2015 competition on robust reading
Incidental Scene Text Understanding: Recent Progresses on ICDAR 2015 Robust Reading Competition Challenge 4 -Cong Yao, Jianan Wu, Xinyu Zhou, Chi Zhang, Shuchang Zhou, Zhimin Cao, Qi Yin

Link

awesome-deep-text-detection-recognition
Awesome-Scene-Text-Recognition

About