S.X.Zhang's repositories
TextBPN-Plus-Plus
Arbitrary Shape Text Detection via Boundary Transformer;The paper at: https://arxiv.org/abs/2205.05320, which has been accepted by IEEE Transactions on Multimedia (T-MM 2023).
Focal-loss
The code is tensorflow implement for focal loss for Dense Object Detection. https://arxiv.org/abs/1708.02002
AnalysisEEG
2020年研究生数学建模竞赛C题-脑电波分析(代码及数据)
TaggingTool
An annotation tool for target detection and text detection, which supports both image and video media files and only supports Windows system environment. labelMe, Tagging, Annotation.
TextFormat_To_cocoJson
converting detection txt format for COCO json format
python-Interface-Cpp
Interface python code by C++ , support python3
MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (Evaluation Pipeline)
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities