UPCLJ's starred repositories

ST-PlusPlus

[CVPR 2022] ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Language:PythonLicense:MITStargazers:231Issues:0Issues:0

structure_knowledge_distillation

The official code for the paper 'Structured Knowledge Distillation for Semantic Segmentation'. (CVPR 2019 ORAL) and extension to other tasks.

Language:PythonLicense:BSD-2-ClauseStargazers:696Issues:0Issues:0

SSKD

[ECCV2020] Knowledge Distillation Meets Self-Supervision

Language:PythonStargazers:233Issues:0Issues:0

frangi3d

Computes vesselness scores for 3-dimensional images.

Language:PythonLicense:MITStargazers:69Issues:0Issues:0

Pytorch-UNet

PyTorch implementation of the U-Net for image semantic segmentation with high quality images

Language:PythonLicense:GPL-3.0Stargazers:8823Issues:0Issues:0

Unet-Segmentation-Pytorch-Nest-of-Unets

Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet

Language:PythonLicense:MITStargazers:1826Issues:0Issues:0

Swin-Transformer-Object-Detection

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Language:PythonLicense:Apache-2.0Stargazers:1772Issues:0Issues:0

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13469Issues:0Issues:0

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonLicense:MITStargazers:144Issues:0Issues:0

hcrn-videoqa

Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)

Language:PythonLicense:Apache-2.0Stargazers:129Issues:0Issues:0

SUTD-TrafficQA

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

Language:JavaScriptStargazers:49Issues:0Issues:0

TVQA

[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering

Language:PythonLicense:MITStargazers:169Issues:0Issues:0

mac-network

Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)

Language:PythonLicense:Apache-2.0Stargazers:492Issues:0Issues:0

asg2cap

Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., CVPR 2020, Oral).

Language:PythonLicense:MITStargazers:199Issues:0Issues:0

medicat

Dataset of medical images, captions, subfigure-subcaption annotations, and inline textual references

Language:PythonLicense:Apache-2.0Stargazers:118Issues:0Issues:0

CCN

Connective Cognition Network for Directional Visual Commonsense Reasoning

Language:PythonStargazers:15Issues:0Issues:0

SEAM

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

Language:PythonLicense:MITStargazers:539Issues:0Issues:0

openvqa

A lightweight, scalable, and general framework for visual question answering research

Language:PythonLicense:Apache-2.0Stargazers:316Issues:0Issues:0

mcan-vqa

Deep Modular Co-Attention Networks for Visual Question Answering

Language:PythonLicense:Apache-2.0Stargazers:436Issues:0Issues:0

EvalAI

:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI

Language:PythonLicense:NOASSERTIONStargazers:1736Issues:0Issues:0

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonLicense:MITStargazers:8681Issues:0Issues:0

vqa.pytorch

Visual Question Answering in Pytorch

Language:PythonStargazers:711Issues:0Issues:0

tallyqacode

Official Code for "TallyQA: Answering Complex Counting Questions" published at AAAI 2018

Language:PythonStargazers:7Issues:0Issues:0

TallyQA_dataset

TallyQA: Answering Complex Counting Questions dataset

License:Apache-2.0Stargazers:19Issues:0Issues:0

PL-ZSD_Release

This repository provides training and evaluation code for paper titled "Polar Loss for Zero-Shot Object Detection." (Arxiv version) and "Improved Visual-Semantic Alignment for Zero-Shot Object Detection" (accepted in AAAI 2020)

Language:PythonLicense:MITStargazers:119Issues:0Issues:0

Mask_RCNN

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Language:PythonLicense:NOASSERTIONStargazers:24478Issues:0Issues:0

VideoNet_Baseline

Baseline method for VideoNet Competition

Language:Jupyter NotebookLicense:MITStargazers:31Issues:0Issues:0

py-faster-rcnn

Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version

Language:PythonLicense:NOASSERTIONStargazers:8082Issues:0Issues:0

3D-ResNets-PyTorch

3D ResNets for Action Recognition (CVPR 2018)

Language:PythonLicense:MITStargazers:3856Issues:0Issues:0

bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Language:Jupyter NotebookLicense:MITStargazers:1416Issues:0Issues:0