xiaoting (tink2123)

tink2123

Geek Repo

Company:baidu

Location:BeiJing

Github PK Tool:Github PK Tool

xiaoting's starred repositories

LaTeX_OCR_PRO

:art: 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reasoning (based on LaTeX AST).

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:990Issues:0Issues:0

GPT-4V_OCR

Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

Language:PythonStargazers:103Issues:0Issues:0

ERNIE-SDK

ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:299Issues:0Issues:0

LaTeX_OCR

:gem: 数学公式识别 Math Formula OCR

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:463Issues:0Issues:0

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language:PythonLicense:MITStargazers:10714Issues:0Issues:0
Stargazers:72Issues:0Issues:0

TUTA_table_understanding

TUTA and ForTaP for Structure-Aware and Numerical-Reasoning-Aware Table Pre-Training

Language:PythonLicense:MITStargazers:91Issues:0Issues:0

synthtiger

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

Language:PythonLicense:MITStargazers:419Issues:0Issues:0

benchmarking-chinese-text-recognition

This repository contains datasets and baselines for benchmarking Chinese text recognition.

Language:PythonLicense:MITStargazers:383Issues:0Issues:0

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:2320Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:35Issues:0Issues:0

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5454Issues:0Issues:0

Consistency_Regularization_STR

It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.

Language:PythonLicense:MITStargazers:27Issues:0Issues:0

STR-Fewer-Labels

Scene Text Recognition (STR) methods trained with fewer real labels (CVPR 2021)

Language:Jupyter NotebookLicense:MITStargazers:170Issues:0Issues:0

Style-Transfer-in-Text

Paper List for Style Transfer in Text

Stargazers:1588Issues:0Issues:0

FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

Language:C++License:Apache-2.0Stargazers:2695Issues:0Issues:0

trivialaugment

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Language:PythonLicense:MITStargazers:139Issues:0Issues:0

simclr

SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3924Issues:0Issues:0

kornia

Geometric Computer Vision Library for Spatial AI

Language:PythonLicense:Apache-2.0Stargazers:9362Issues:0Issues:0

PaddleHub

Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)

Language:PythonLicense:Apache-2.0Stargazers:12501Issues:0Issues:0

FudanOCR

A toolbox of scene text super-resolution and recognition

Language:PythonStargazers:307Issues:0Issues:0

data-augmentation-review

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

Stargazers:1557Issues:0Issues:0

dmfont

Official PyTorch implementation of DM-Font (ECCV 2020)

Language:PythonLicense:MITStargazers:128Issues:0Issues:0

fewshot-font-generation

The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).

Language:PythonLicense:NOASSERTIONStargazers:187Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:38331Issues:0Issues:0

OCR_preprocessing_tool

A simple OCR preprocessing tool using Python with a GUI.

Language:PythonLicense:MITStargazers:27Issues:0Issues:0

optlab

OCR pre-processing Toolbox

Language:C++Stargazers:17Issues:0Issues:0

PaddleOCR-AutoHotkey

PaddleOCR AutoHotkey Version. PaddleOCR AHK 版。

Language:AutoHotkeyStargazers:131Issues:0Issues:0

PaddleOCR-Quicker

GUI for PaddleOCR whl based on Quicker

Stargazers:13Issues:0Issues:0