Alberto Baldrati (ABaldrati)

ABaldrati

Geek Repo

Company:University of Florence - MICC, University of Pisa

Location:Florence, Italy

Home Page:https://abaldrati.github.io

Twitter:@A_Baldrati

Github PK Tool:Github PK Tool

Alberto Baldrati's starred repositories

composed-video-retrieval

Composed Video Retrieval

Language:PythonLicense:Apache-2.0Stargazers:40Issues:0Issues:0

context-i2w

Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]

Language:ShellLicense:Apache-2.0Stargazers:36Issues:0Issues:0

iamcl2r

[CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacements (notable top 2.8%)

Language:Jupyter NotebookLicense:MITStargazers:8Issues:0Issues:0

QualiCLIP

Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment

Language:PythonLicense:NOASSERTIONStargazers:29Issues:0Issues:0

DiffAssemble

Official repository for "DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly" accepted at CVPR2024

Language:PythonStargazers:56Issues:0Issues:0

Vision_by_Language

[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"

Language:PythonLicense:MITStargazers:36Issues:0Issues:0

revisitop

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking

Language:PythonStargazers:249Issues:0Issues:0

SPRC

【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval

Language:PythonStargazers:57Issues:0Issues:0

simpool

This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"

Language:PythonLicense:Apache-2.0Stargazers:93Issues:0Issues:0

Bi-Blip4CIR

The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Prompt Learning (WACV 2024)

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

SULAND-Dataset

Dataset for Surface Landmine detection. Videos are taken in Italy (Faculty of Engineering, Florence) and USA (Franklyn and Marshal college, Philadelphia).

Language:PythonLicense:NOASSERTIONStargazers:4Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0

Land-Diffuser

The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation from raw audio inputs.

Language:PythonStargazers:11Issues:0Issues:0

mdistiller

The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 and [ICCV2023] DOT: A Distillation-Oriented Trainer https://openaccess.thecvf.com/content/ICCV2023/papers/Zhao_DOT_A_Distillation-Oriented_Trainer_ICCV_2023_paper.pdf

Language:PythonStargazers:777Issues:0Issues:0

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

Stargazers:5473Issues:0Issues:0

lincir

Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:92Issues:0Issues:0

PromptAlign

[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization

Language:PythonStargazers:89Issues:0Issues:0

safe-clip

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024

Language:PythonStargazers:31Issues:0Issues:0

sugar-crepe

[NeurIPS 2023] A faithful benchmark for vision-language compositionality

Language:PythonLicense:MITStargazers:66Issues:0Issues:0

CIRPLANT

Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

Language:PythonLicense:MITStargazers:36Issues:0Issues:0

FriendsDontLetFriends

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

Language:RLicense:MITStargazers:6252Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:8Issues:0Issues:0
Language:PythonStargazers:142Issues:0Issues:0

genecis

Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"

Language:PythonLicense:NOASSERTIONStargazers:53Issues:0Issues:0

poincare-resnet

Repository containing the code used for running the experiments of the Poincare ResNet paper

Language:PythonLicense:Apache-2.0Stargazers:19Issues:0Issues:0
Language:PythonStargazers:15Issues:0Issues:0

hyperbolic_learning_library

An extension of the PyTorch library containing various tools for performing deep learning in hyperbolic space.

Language:PythonLicense:MITStargazers:127Issues:0Issues:0

ARNIQA

[WACV 2024 Oral] - ARNIQA: Learning Distortion Manifold for Image Quality Assessment

Language:PythonLicense:NOASSERTIONStargazers:83Issues:0Issues:0

TAPE

[WACV 2024] - Reference-based Restoration of Digitized Analog Videotapes

Language:PythonLicense:NOASSERTIONStargazers:39Issues:0Issues:0

CoVR

Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".

Language:PythonLicense:MITStargazers:78Issues:0Issues:0