BowenYang's starred repositories

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:43211Issues:443Issues:9276

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:31801Issues:312Issues:916

labelImg

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

Language:PythonLicense:MITStargazers:22593Issues:403Issues:767

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:6558Issues:95Issues:680

awesome-satellite-imagery-datasets

🛰️ List of satellite image training datasets with annotations for computer vision and deep learning

towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Language:PythonLicense:Apache-2.0Stargazers:3188Issues:29Issues:664

a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language:PythonLicense:MITStargazers:2752Issues:25Issues:187

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:2405Issues:21Issues:363

neovis.js

Neo4j + vis.js = neovis.js. Graph visualizations in the browser with data from Neo4j.

Language:TypeScriptLicense:Apache-2.0Stargazers:1599Issues:43Issues:269

CLUENER2020

CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition

Scene-Graph-Benchmark.pytorch

A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”

Language:Jupyter NotebookLicense:MITStargazers:1061Issues:17Issues:203

awesome-image-captioning

A curated list of image captioning and related area resources. :-)

roLabelImg

Label Rotated Rect On Images for training

TextFuseNet

A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".

Language:PythonLicense:MITStargazers:475Issues:7Issues:110

scene_graph_benchmark

image scene graph generation benchmark

Language:PythonLicense:MITStargazers:387Issues:13Issues:95

MUStARD

Multimodal Sarcasm Detection Dataset

Language:OpenEdge ABLLicense:MITStargazers:304Issues:8Issues:11

POI-Recommendation

Papers and resources about POI recommendation. | 兴趣点推荐相关论文、模型和资源。

DOLG-pytorch

Unofficial PyTorch Implementation of "DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features"

Language:PythonLicense:MITStargazers:127Issues:3Issues:2

awesome-few-shot-learning

A review for latest few-shot learning works

License:MITStargazers:123Issues:5Issues:0

TIM

(NeurIPS 2020) Transductive Information Maximization for Few-Shot Learning https://arxiv.org/abs/2008.11297

Language:PythonLicense:MITStargazers:118Issues:6Issues:14

MOSEI_UMONS

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis

Language:PythonLicense:MITStargazers:117Issues:8Issues:22

ProST

Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral

Language:PythonLicense:Apache-2.0Stargazers:90Issues:3Issues:7

Feature-Extractors-for-Video-Steganalysis

To provide the stego community with C/C++ implementations of selected feature extractors mainly targeted at H.264 steganography.

ts2_net

[ECCV2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval

trojan

科学上网/翻墙梯子/自由上网/trojan 搭建教程 免费机场、VPN工具 小白科学上网一键搭建VPN梯子最新2022教程

DeeperForensicsChallengeSolution

The solution for the DeeperForensics Challenge 2020

Language:PythonLicense:MITStargazers:28Issues:1Issues:2

PDAN

[WACV2021] Implementation of Pyramid Dilated Attention Network (PDAN)