Ioanna Ntinou's repositories
ACAR-Net
Actor-Context-Actor Relation Network for Spatio-temporal Action Localization
AlphAction
Spatio-Temporal Action Localization System
awesome-action-recognition
A curated list of action recognition and related area resources
AWESOME-MER
π π A reading list focused on Multimodal Emotion Recognition (MER) ππ π π¬
Awesome-Open-Vocabulary-Object-Detection
A curated list of papers, datasets and resources pertaining to open vocabulary object detection.
Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Detectron
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
Enjoy-Hamburger
[ICLR 2021] Is Attention Better Than Matrix Decomposition?
classifier-balancing
This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020
Dassl.pytorch
A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.
detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
EmoCommonSense
Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning
epic-kitchens-download-scripts
Download scripts for EPIC-KITCHENS
EVAD
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and recognition (InceptionResnet) models
linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
MeMViT
Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022
mixup-cifar10
mixup: Beyond Empirical Risk Minimization
mmaction
An open-source toolbox for action understanding based on PyTorch
Non-local_pytorch
Implementation of Non-local Block.
pytorch-cifar
95.47% on CIFAR10 with PyTorch
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
video-long-term-feature-banks
Long-Term Feature Banks for Detailed Video Understanding