AImageLab (aimagelab)

AImageLab

aimagelab

Organization data from Github https://github.com/aimagelab

Location:Modena, Italy

Home Page:aimagelab.ing.unimore.it

GitHub:@aimagelab

AImageLab's repositories

mammoth

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Language:PythonLicense:MITStargazers:656Issues:13Issues:44

dress-code

Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022

Language:PythonLicense:NOASSERTIONStargazers:604Issues:17Issues:35

LLaVA-MORE

LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Language:PythonLicense:Apache-2.0Stargazers:125Issues:6Issues:9
Language:PythonLicense:MITStargazers:80Issues:3Issues:28

pacscore

[CVPR 2023] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation

mil4wsi

DAS-MIL: Distilling Across Scales for MILClassification of Histological WSIs

Language:PythonLicense:MITStargazers:56Issues:6Issues:11

awesome-human-visual-attention

This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.

CoDE

[ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities

Language:PythonLicense:MITStargazers:34Issues:3Issues:3
Language:PythonLicense:NOASSERTIONStargazers:25Issues:4Issues:2

ReflectiVA

[CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering

Language:PythonLicense:Apache-2.0Stargazers:21Issues:4Issues:1

DiCO

[BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization

Language:PythonStargazers:17Issues:2Issues:0

MaPeT

Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training

ReT

[CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval

Language:PythonLicense:Apache-2.0Stargazers:14Issues:0Issues:0

HySAC

Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025

Language:PythonStargazers:12Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:11Issues:1Issues:1

awesome-captioning-evaluation

Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives

Stargazers:6Issues:0Issues:0

COGT

[ICLR'25] Causal Graphical Models for Vision-Language Compositional Understanding

Language:PythonStargazers:6Issues:0Issues:0

LAM

The Ludovico Antonio Muratori (LAM) dataset is the largest line-level HTR dataset to date and contains 25,823 lines from Italian ancient manuscripts edited by a single author over 60 years. The dataset comes in two configurations: a basic splitting and a date-based splitting which takes into account the age of the author. The first setting is intended to study HTR on ancient documents in Italian, while the second focuses on the ability of HTR systems to recognize text written by the same writer in time periods for which training data are not available.

Language:PythonLicense:MITStargazers:4Issues:3Issues:1
Language:HTMLLicense:Apache-2.0Stargazers:4Issues:2Issues:0

fed-mammoth

General Federated Continual Learning Framework

Language:HTMLLicense:Apache-2.0Stargazers:2Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:1Issues:0

coldfront

HPC Resource Allocation System

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

itserr-wp8-latin-embeddings

ITSERR WP8 - Code for Latin embeddings semantic search

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

pipelines

Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0