Alberto Baldrati (ABaldrati)

ABaldrati

Geek Repo

Company:University of Florence - MICC, University of Pisa

Location:Florence, Italy

Home Page:https://abaldrati.github.io

Twitter:@A_Baldrati

Github PK Tool:Github PK Tool

Alberto Baldrati's starred repositories

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:Apache-2.0Stargazers:10828Issues:162Issues:190

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

Language:Jupyter NotebookLicense:MITStargazers:2845Issues:53Issues:157

mdistiller

The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 and [ICCV2023] DOT: A Distillation-Oriented Trainer https://openaccess.thecvf.com/content/ICCV2023/papers/Zhao_DOT_A_Distillation-Oriented_Trainer_ICCV_2023_paper.pdf

FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:668Issues:10Issues:35

DriveAGI

[Incl. GenAD, CVPR 2024 Highlight] Embracing Foundation Models into Autonomous Agent and System

Language:PythonLicense:Apache-2.0Stargazers:444Issues:25Issues:5

MagicDrive

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

Language:PythonLicense:AGPL-3.0Stargazers:420Issues:14Issues:43

Vista

A Generalizable World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:345Issues:18Issues:14

Awesome-World-Model

Collect some World Models for Autonomous Driving papers.

revisitop

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking

simpool

This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"

Language:PythonLicense:Apache-2.0Stargazers:92Issues:2Issues:1

PromptAlign

[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization

lincir

Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:79Issues:7Issues:12

SPRC

【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval

rscir

Official PyTorch implementation and benchmark dataset for IGARSS 2024 ORAL paper: "Composed Image Retrieval for Remote Sensing"

Language:PythonLicense:Apache-2.0Stargazers:47Issues:0Issues:0

TPD

This is the official repository for the paper "Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On". CVPR 2024

Language:PythonStargazers:46Issues:0Issues:0

DiffAssemble

Official repository for "DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly" accepted at CVPR2024

context-i2w

Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]

Language:ShellLicense:Apache-2.0Stargazers:31Issues:2Issues:7

Vision_by_Language

[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"

composed-video-retrieval

Composed Video Retrieval

Language:PythonLicense:Apache-2.0Stargazers:29Issues:2Issues:3

QualiCLIP

Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment

License:NOASSERTIONStargazers:24Issues:7Issues:0

Bi-Blip4CIR

The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Prompt Learning (WACV 2024)

Language:PythonLicense:MITStargazers:22Issues:3Issues:1

FG-OVD

[CVPR2024] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."

Language:PythonLicense:MITStargazers:18Issues:1Issues:0

Land-Diffuser

The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation from raw audio inputs.

Candidate-Reranking-CIR

The official implementation for Candidate Set Re-ranking for Composed Image Retrieval (TMLR) 01/2024

Language:PythonLicense:MITStargazers:9Issues:1Issues:1

iamcl2r

[CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacements (notable top 2.8%)

Language:Jupyter NotebookLicense:MITStargazers:8Issues:5Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5Issues:2Issues:1

concon-chi_benchmark

Repository to host the code associated to the CVPR 2024 paper "ConCon-Chi: Concept-Context Chimera Benchmark for Personalized Vision-Language Tasks"

Language:PythonLicense:BSD-3-ClauseStargazers:5Issues:4Issues:0

SULAND-Dataset

Dataset for Surface Landmine detection. Videos are taken in Italy (Faculty of Engineering, Florence) and USA (Franklyn and Marshal college, Philadelphia).

Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0