marcellacornia

followers

following

stars

AImageLab, University of Modena and Reggio Emilia

Modena, Italy

Organizations

aimagelab

Marcella Cornia's starred repositories

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT22963 315 385

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause9018 95 617

mdetr

Language:PythonApache-2.0952 19 96

ActivityNet-Entities

A Dataset for Grounded Video Description

Language:PythonNOASSERTION157 18 9

pacscore

Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023

Language:Python48 6 6

MLNet-Pytorch

Implementation of A Deep Multi-Level Network for Saliency Prediction in Pytorch

Language:Jupyter NotebookApache-2.030 4 4

awesome-human-visual-attention

This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.

2100

DynamicConv-agent

PyTorch code for BMVC 2019 paper: Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters

Language:C++MIT21 40

perceive-transform-and-act

PyTorch code for the paper: "Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation"

Language:C++MIT19 4 1

MaPeT

Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training

Language:Python15 6 2

LoCoNav

Language:PythonMIT13 5 1