Tom's repositories
Fast-Chinese-OCR
该项目采用最前沿的AI算法,针对合同扫描文档进行识别和抽取。
ocr_detection_psenet
Text detection using psenet
QT-style-template
QT-style-template
2019-CCF-BDCI-OCR-MCZJ-fake_data_generator
2019CCF-BDCI大赛 OCR赛题第一名 天晨破晓团队 仿真数据生成方案源码
chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn推理 , dbnet(1.7M) + crnn(6.3M) + anglenet(1.5M) 总模型仅10M
coco2voc
Yet another coco2voc.py, convert MSCOCO json to PASCAL VOC xml in Object Detection.
ControlNet
Let us control diffusion models
DAVAR-Lab-OCR
The implementations of some works from Davar-Lab. Currently we have the code of Text Perceptron (AAAI 2020). Some works' code will be published soon, including YORO (ACMMM 2019) , TRIE (ACMMM2020), FREE(TIP 2020), SPIN (AAAI 2021), MANGO (AAAI2021), etc.
diff-pdf
Compare PDF documents using PDF Miner and print out the differences as HTML documents
fasterAPI
GIT LEARN DJANGO
InvoiceNet
Deep neural network to extract intelligent information from invoice documents.
keras-docs-zh
Chinese (zh-cn) translation of the Keras documentation.
ocr
ocr system
PDF-Resume-Information-Extraction
天池比赛作品整理。实现从pdf中提取出姓名、出生年月、性别、电话、最高学历、籍贯、落户市县、政治面貌、毕业院校、工作单位、工作内容、职务、项目名称、项目责任、学位、毕业时间、工作时间、项目时间共18个字段。
pdfannots
Extracts and formats text annotations from a PDF file
PSENet-tf2
PSEnet tf2.0 reimplementation for better training and inference and ResneSt/Mobilenet/ tensorflow2 implement/ Top 6 model in MTWI 2018 Text Detection
Table-OCR
Recognize tables from images and restore them into word.
TensorflowDeployWithCpp
Using c++ load tensorflow2.0 saved_model and do inference.
TextRecognitionDataGenerator
A synthetic data generator for text recognition
TF2-albert-NER
wrapping albert via bert-for-tf2, implementing NER task
tr
Free Offline OCR 离线的文本识别SDK