ustczhouyu's repositories

Language:PythonStargazers:0Issues:0Issues:0

AutoSTR

H. Zhang, Q. Yao, M. Yang, Y. Xu, X. Bai. AutoSTR: Efficient Backbone Search for Scene Text Recognition. European Conference on Computer Vision (ECCV). 2020.

Language:PythonStargazers:0Issues:0Issues:0

Bi-STET

Implementation of Bidirectional Scene Text Recognition with a Single Decoder

Language:PythonStargazers:0Issues:1Issues:0

contour

Learning Panoptic Segmentation from Instance Contours

Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

cropping-design-constraints

codes for evaluating image cropping under design constraints

Stargazers:0Issues:0Issues:0

D2Det

D2Det, CVPR2020

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

gencrop

Code for Learning Subject-Aware Cropping by Outpainting Professional Photos

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

ICDAR-2019-SROIE

ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

InterpAny-Clearer

Clearer anytime frame interpolation & Manipulated interpolation of anything

License:MITStargazers:0Issues:0Issues:0

kvasir-seg

2020 MediaEval Medico Challenge: Polyp Segmentation

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

line-art-colorization

Simplified alacgan-based line art colorizer

Stargazers:0Issues:0Issues:0

lineArt

A sketch like(?) effect for images

License:MITStargazers:0Issues:0Issues:0

nvist_official

(CVPR 2024) NViST: In the wild New View Synthesis from a Single Image with Transformers

License:MITStargazers:0Issues:0Issues:0

OCR-Corrector

利用语言模型,纠正OCR识别错误

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

PaddleOCR

OCR toolkit based on PaddlePaddle (基于飞桨的OCR工具库,包含总模型仅8.6M的超轻量级中文OCR,同时支持多种文本检测、文本识别的训练算法。)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pan-pytorch

This is an unofficial PyTorch re-implementation of paper "Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network" published in ICCV 2019.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

RethinkingImageCropping-Paddle

PaddlePaddle Implementation of "Rethinking Image Cropping: Exploring Diverse Compositions from Global Views"

Stargazers:0Issues:0Issues:0

S2CNet

Official PyTorch implementation of the “Spatial-Semantic Collaborative Cropping for User Generated Content”. (AAAI24)

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

SRN.pytorch

Unofficial PyTorch implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

Language:PythonStargazers:0Issues:1Issues:0

stable-diffusion-reference-only

img2img version of stable diffusion. Anime Character Remix. Line Art Automatic Coloring. Style Transfer.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

StraightToThePoint_CVPR_2020

Original PyTorch implementation of the code for the paper "Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data" at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020

License:GPL-3.0Stargazers:0Issues:0Issues:0

text-recognition

Pytorch for image-based sequence recognition tasks, such as scene text recognition and OCR.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

Transformer_STR

PyTorch implementation of my new method for Scene Text Recognition (STR) based on Transformer,Equipped with Transformer, this method outperforms the best model of the aforementioned deep-text-recognition-benchmark by 7.6% on CUTE80.

Language:PythonStargazers:0Issues:1Issues:0

VideoTetris

VideoTetris: Towards Compositional Text-To-Video Generation

Stargazers:0Issues:0Issues:0