BowenYang's starred repositories

towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Language:PythonLicense:Apache-2.0Stargazers:3021Issues:0Issues:0

ProST

Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral

Language:PythonLicense:Apache-2.0Stargazers:83Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Stargazers:9585Issues:0Issues:0

trojan

科学上网/翻墙梯子/自由上网/trojan 搭建教程 免费机场、VPN工具 小白科学上网一键搭建VPN梯子最新2022教程

Stargazers:36Issues:0Issues:0

ts2_net

[ECCV2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval

Language:PythonStargazers:72Issues:0Issues:0

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:2340Issues:0Issues:0

a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language:PythonLicense:MITStargazers:2675Issues:0Issues:0

awesome-image-captioning

A curated list of image captioning and related area resources. :-)

Stargazers:1046Issues:0Issues:0

Feature-Extractors-for-Video-Steganalysis

To provide the stego community with C/C++ implementations of selected feature extractors mainly targeted at H.264 steganography.

Stargazers:80Issues:0Issues:0

CLUENER2020

CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition

Language:PythonStargazers:1406Issues:0Issues:0

POI-Recommendation

Papers and resources about POI recommendation. | 兴趣点推荐相关论文、模型和资源。

Stargazers:198Issues:0Issues:0

DeeperForensicsChallengeSolution

The solution for the DeeperForensics Challenge 2020

Language:PythonLicense:MITStargazers:28Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30126Issues:0Issues:0

DOLG-pytorch

Unofficial PyTorch Implementation of "DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features"

Language:PythonLicense:MITStargazers:127Issues:0Issues:0

MUStARD

Multimodal Sarcasm Detection Dataset

Language:OpenEdge ABLLicense:MITStargazers:284Issues:0Issues:0

MOSEI_UMONS

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis

Language:PythonLicense:MITStargazers:112Issues:0Issues:0

Scene-Graph-Benchmark.pytorch

A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”

Language:Jupyter NotebookLicense:MITStargazers:1019Issues:0Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:6320Issues:0Issues:0

PDAN

[WACV2021] Implementation of Pyramid Dilated Attention Network (PDAN)

Language:PythonStargazers:18Issues:0Issues:0

scene_graph_benchmark

image scene graph generation benchmark

Language:PythonLicense:MITStargazers:376Issues:0Issues:0

VinVL

project page for VinVL

Stargazers:345Issues:0Issues:0

roLabelImg

Label Rotated Rect On Images for training

Language:PythonStargazers:778Issues:0Issues:0

TextFuseNet

A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".

Language:PythonLicense:MITStargazers:464Issues:0Issues:0

neovis.js

Neo4j + vis.js = neovis.js. Graph visualizations in the browser with data from Neo4j.

Language:TypeScriptLicense:Apache-2.0Stargazers:1519Issues:0Issues:0

labelImg

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

Language:PythonLicense:MITStargazers:21947Issues:0Issues:0

awesome-few-shot-learning

A review for latest few-shot learning works

License:MITStargazers:123Issues:0Issues:0

TIM

(NeurIPS 2020) Transductive Information Maximization for Few-Shot Learning https://arxiv.org/abs/2008.11297

Language:PythonLicense:MITStargazers:116Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:39146Issues:0Issues:0

awesome-satellite-imagery-datasets

🛰️ List of satellite image training datasets with annotations for computer vision and deep learning

License:MITStargazers:3504Issues:0Issues:0