aukhan's repositories
WSG-VQA-VLTransformers
Weakly Supervised Grounding for VQA in Vision-Language Transformers
utility-matlab-functions
This repository contains functions which I wrote to reuse for some basic tasks related to computer vision or can be used in general .
mobile-semantic-segmentation
Real-Time Semantic Segmentation in Mobile device
aishaurooj.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
clevr-dataset-gen
A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
cross-view-image-synthesis
Cross-View Image Synthesis using Conditional GANs
densecap
Dense image captioning in Torch
hello-world
Test repository
mac-network
Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)
mac-network-pytorch
Memory, Attention and Composition (MAC) Network for CLEVR implemented in PyTorch
MultiGrounding
This is the repo for Multi-level textual grounding
pytorch-c3d
PyTorch implemented C3D and R2Plus1D models for video action recognition.
Segmenting-Sky-Pixels-in-Images
Scripts and other code used in Segementing Sky Pixels In Images
video-classification-3d-cnn-pytorch
Video classification tools using 3D ResNet