Sangmin Woo's repositories
awesome-vision-and-language
A curated list of awesome vision and language resources (still under construction... stay tuned!)
Depth_from_Focus
Conventional Depth from Focus(DfF) estimation with slight focus variations in image sequences
Explore-And-Match
Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos"
RecycleNet
Attentional Learning of Trash Classification
Temporal-Span-Proposal-Network-VidVRD
What and When to look?: Temporal Span Proposal Network for Video Relation Detection
Local-to-Global-Interaction-Networks-SGG
[TNNLS 2022] Official pytorch implementation of "Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions"
Cost-Out-Multitask-Learning
[Electronics] Revisiting Dropout: Escaping Pressure for Training Neural Networks with Multiple Costs
AdaFocus
Reducing spatial redundancy in video recognition. SOTA computational efficiency.
ai-deadlines
:alarm_clock: AI conference deadline countdowns
arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
awesome-semantic-segmentation-pytorch
Semantic Segmentation on PyTorch (include FCN, PSPNet, Deeplabv3, Deeplabv3+, DANet, DenseASPP, BiSeNet, EncNet, DUNet, ICNet, ENet, OCNet, CCNet, PSANet, CGNet, ESPNet, LEDNet, DFANet)
flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
iPerceive
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention | Published in IEEE Winter Conference on Applications of Computer Vision (WACV) 2021
meshrcnn
code for Mesh R-CNN, ICCV 2019
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
MultiMAE
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
PlotNeuralNet
Latex code for making neural networks diagrams
sangminwoo.github.io
sangminwoo.github.io
SimCLR
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
STTran
Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021
TimeSformer-pytorch
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
vissl
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.