BowenYang's starred repositories
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
labelImg
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
awesome-satellite-imagery-datasets
🛰️ List of satellite image training datasets with annotations for computer vision and deep learning
a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
CLUENER2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”
awesome-image-captioning
A curated list of image captioning and related area resources. :-)
roLabelImg
Label Rotated Rect On Images for training
TextFuseNet
A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".
scene_graph_benchmark
image scene graph generation benchmark
POI-Recommendation
Papers and resources about POI recommendation. | 兴趣点推荐相关论文、模型和资源。
DOLG-pytorch
Unofficial PyTorch Implementation of "DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features"
awesome-few-shot-learning
A review for latest few-shot learning works
MOSEI_UMONS
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
Feature-Extractors-for-Video-Steganalysis
To provide the stego community with C/C++ implementations of selected feature extractors mainly targeted at H.264 steganography.
DeeperForensicsChallengeSolution
The solution for the DeeperForensics Challenge 2020