ltp1995's repositories
GCAGC-CVPR2020
Co-saliency detection, GCAGC, CVPR2020, GCAGC-Inst, TMM2021. Adaptive Graph Convolutional Network with Attention Graph Clustering for Co-saliency Detection
Semantic-Cosegmentation
The codes for the ECCV16 paper: "Semantic Co-segmentation in Videos", Y.-H. Tsai*, G.Zhong* and M.-H. Yang.
video_features
Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as I3D, R(2+1)D, VGGish, ResNet features.
Language:PythonGPL-3.0000
xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Language:PythonNOASSERTION000