Beast code in Giters

UPCLJ's starred repositories

ST-PlusPlus

[CVPR 2022] ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Language:PythonMIT23100

structure_knowledge_distillation

The official code for the paper 'Structured Knowledge Distillation for Semantic Segmentation'. (CVPR 2019 ORAL) and extension to other tasks.

Language:PythonBSD-2-Clause69600

SSKD

[ECCV2020] Knowledge Distillation Meets Self-Supervision

Language:Python23300

frangi3d

Computes vesselness scores for 3-dimensional images.

Language:PythonMIT6900

Pytorch-UNet

PyTorch implementation of the U-Net for image semantic segmentation with high quality images

Language:PythonGPL-3.0882300

Unet-Segmentation-Pytorch-Nest-of-Unets

Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet

Language:PythonMIT182600

Swin-Transformer-Object-Detection

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Language:PythonApache-2.0177200

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT1346900

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonMIT14400

hcrn-videoqa

Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)

Language:PythonApache-2.012900

SUTD-TrafficQA

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

Language:JavaScript4900

TVQA

[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering

Language:PythonMIT16900

mac-network

Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)

Language:PythonApache-2.049200

asg2cap

Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., CVPR 2020, Oral).

Language:PythonMIT19900

medicat

Dataset of medical images, captions, subfigure-subcaption annotations, and inline textual references

Language:PythonApache-2.011800

CCN

Connective Cognition Network for Directional Visual Commonsense Reasoning

Language:Python1500

SEAM

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

Language:PythonMIT53900

openvqa

A lightweight, scalable, and general framework for visual question answering research

Language:PythonApache-2.031600

mcan-vqa

Deep Modular Co-Attention Networks for Visual Question Answering

Language:PythonApache-2.043600

EvalAI

:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI

Language:PythonNOASSERTION173600

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonMIT868100

vqa.pytorch

Visual Question Answering in Pytorch

Language:Python71100

tallyqacode

Official Code for "TallyQA: Answering Complex Counting Questions" published at AAAI 2018

Language:Python700

TallyQA_dataset

TallyQA: Answering Complex Counting Questions dataset

Apache-2.01900

This repository provides training and evaluation code for paper titled "Polar Loss for Zero-Shot Object Detection." (Arxiv version) and "Improved Visual-Semantic Alignment for Zero-Shot Object Detection" (accepted in AAAI 2020)

Language:PythonMIT11900

Mask_RCNN

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Language:PythonNOASSERTION2447800

VideoNet_Baseline

Baseline method for VideoNet Competition

Language:Jupyter NotebookMIT3100

py-faster-rcnn

Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version

Language:PythonNOASSERTION808200

3D-ResNets-PyTorch

3D ResNets for Action Recognition (CVPR 2018)

Language:PythonMIT385600

bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Language:Jupyter NotebookMIT141600