BowenYang's starred repositories
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
awesome-image-captioning
A curated list of image captioning and related area resources. :-)
Feature-Extractors-for-Video-Steganalysis
To provide the stego community with C/C++ implementations of selected feature extractors mainly targeted at H.264 steganography.
CLUENER2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
POI-Recommendation
Papers and resources about POI recommendation. | 兴趣点推荐相关论文、模型和资源。
DeeperForensicsChallengeSolution
The solution for the DeeperForensics Challenge 2020
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
DOLG-pytorch
Unofficial PyTorch Implementation of "DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features"
MOSEI_UMONS
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”
scene_graph_benchmark
image scene graph generation benchmark
roLabelImg
Label Rotated Rect On Images for training
TextFuseNet
A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".
labelImg
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
awesome-few-shot-learning
A review for latest few-shot learning works
awesome-satellite-imagery-datasets
🛰️ List of satellite image training datasets with annotations for computer vision and deep learning