IoannaNti

followers

following

stars

@QMUL

Cambridge, UK

Ioanna Ntinou's repositories

BMViT

Apache-2.03 1 1

ACAR-Net

Actor-Context-Actor Relation Network for Spatio-temporal Action Localization

MIT000

AlphAction

Spatio-Temporal Action Localization System

Language:Python000

awesome-action-recognition

A curated list of action recognition and related area resources

000

AWESOME-MER

🔆 📝 A reading list focused on Multimodal Emotion Recognition (MER) 👂👄 👀 💬

MIT000

Awesome-Open-Vocabulary-Object-Detection

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

000

Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

000

Detectron

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Language:PythonApache-2.0000

Enjoy-Hamburger

[ICLR 2021] Is Attention Better Than Matrix Decomposition?

Language:PythonGPL-3.0000

classifier-balancing

This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020

Language:PythonNOASSERTION000

CondensedMovies

Language:Python000

Dassl.pytorch

A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.

MIT000

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonApache-2.0000

EmoCommonSense

Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning

000

epic-kitchens-download-scripts

Download scripts for EPIC-KITCHENS

000

EVAD

[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement

Language:PythonNOASSERTION000

facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and recognition (InceptionResnet) models

Language:PythonMIT000

lasp

MIT000

linear-attention-transformer

Transformer based on a variant of attention that is linear complexity in respect to sequence length

MIT000

MeMViT

Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022

Language:PythonNOASSERTION000

mixup-cifar10

mixup: Beyond Empirical Risk Minimization

Language:PythonNOASSERTION000

ml-afv

Language:PythonNOASSERTION000

mmaction

An open-source toolbox for action understanding based on PyTorch

Language:PythonApache-2.0000

Non-local_pytorch

Implementation of Non-local Block.

Language:PythonApache-2.0000

pytorch-cifar

95.47% on CIFAR10 with PyTorch

Language:PythonMIT000

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Apache-2.0000

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonApache-2.0000

temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Language:PythonApache-2.0000

video-long-term-feature-banks

Long-Term Feature Banks for Detailed Video Understanding

Language:PythonApache-2.0000

website

000