AImageLab's repositories
dress-code
Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
LLaVA-MORE
LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning
awesome-human-visual-attention
This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.
ReflectiVA
[CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
awesome-captioning-evaluation
Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives
LAM
The Ludovico Antonio Muratori (LAM) dataset is the largest line-level HTR dataset to date and contains 25,823 lines from Italian ancient manuscripts edited by a single author over 60 years. The dataset comes in two configurations: a basic splitting and a date-based splitting which takes into account the age of the author. The first setting is intended to study HTR on ancient documents in Italian, while the second focuses on the ability of HTR systems to recognize text written by the same writer in time periods for which training data are not available.
fed-mammoth
General Federated Continual Learning Framework
coldfront
HPC Resource Allocation System
itserr-wp8-latin-embeddings
ITSERR WP8 - Code for Latin embeddings semantic search
open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
pipelines
Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework