PLiang's repositories
CVPR2021-Paper-Code-Interpretation
cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理
awesome-causal-vision
A curated list of research papers in exploring causality in vision. Link to the code if available is also present.
awesome-visual-question-answering
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
CSS-VQA
Counterfactual Samples Synthesizing for Robust VQA
DRN
Dense Regression Network for Video Grounding (CVPR2020)
grid-feats-vqa
Grid features pre-training code for visual question answering
hcrn-videoqa
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
info-ground
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
LBYLNet
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
maskrcnn-benchmark
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
NAFAE
Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Losses"
nlp-beginner
NLP上手教程
nlp-in-python-tutorial
comparing stand up comedians using natural language processing
NMTree
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
pytorch-grad-cam
Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM
ref-nms
Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"
ReSC
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
SAR
Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"
sgmn-1
Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.
SLT-Net
official implementation of "Strengthen Learning Tolerance for Weakly Supervised Object Localization", CVPR2021.
SSL-VQA
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
ThesisUESTC
ThesisUESTC-电子科技大学毕业论文模板
VideoQADenseCapFrameGate-ACL2020
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng Tang, Mohit Bansal.
vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
vqa-cp-leaderboard
A collections of papers about VQA-CP datasets and their results
WSSTG
This repository contains the main baselines introduced in WSSTG (ACL 2019).