Beast code in Giters

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.01010 14 102

Paint-by-Example

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Language:PythonNOASSERTION998 22 51

YOLOS

[NeurIPS 2021] You Only Look at One Sequence

Language:Jupyter NotebookMIT811 21 29

VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Language:PythonMIT431 6 49

lamar-benchmark

Source code for the ECCV 2022 paper "Benchmarking Localization and Mapping for Augmented Reality".

Language:PythonCC-BY-4.0359 27 36

CosPlace

Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"

Language:PythonMIT266 7 42

DELG

Pytorch Implementation of Unifying Deep Local and Global Features for Image Search (DELG)

Language:Jupyter Notebook173 4 21

CVNet

Official PyTorch Implementation of Correlation Verification for Image Retrieval, CVPR 2022 (Oral Presentation)

Language:Python166 13 11

deep-visual-geo-localization-benchmark

Official code for CVPR 2022 (Oral) paper "Deep Visual Geo-localization Benchmark"

Language:PythonMIT161 3 24

TransGeo2022

Official repository for TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization

Language:PythonMIT98 3 40

mapillary_sls

Mapillary Street-level Sequences Dataset

Language:Jupyter NotebookMIT98 13 27

SSHarmonization

[ICCV'2021] "SSH: A Self-Supervised Framework for Image Harmonization", Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang

Language:Jupyter NotebookNOASSERTION97 9 11

VIGOR

Official repository for VIGOR : Cross-View Image Geo-localization beyond One-to-one Retrieval

Language:PythonMIT71 3 12

R2Former

Official repository for R2Former: Unified Retrieval and Reranking Transformer for Place Recognition

Language:Jupyter NotebookApache-2.068 2 15

R2Former

Official repository for R2Former: Unified Retrieval and Reranking Transformer for Place Recognition

36 6 2

CrossViewMetricLocalization

ECCV2022: Visual Cross-View Metric Localization with Dense Uncertainty Estimates

Language:PythonGPL-3.032 1 2

Vision-DiffMask

Official PyTorch implementation of Vision DiffMask, a post-hoc interpretation method for vision models.

Language:Jupyter NotebookMIT27 4 1

TransVPR-model-implementation

Language:Python24 2 8

instructpix2pix-sdxl

Training InstructPi2Pix with SDXL.

Language:Python17 1 3

APRILE

APRILE: A python library for exploring Adverse Polypharmacy Reaction using Intelligent Learner and Explainer

Language:PythonMIT7 2 2

Street-to-Satellite_Image_Matching

Street-to-Satellite Image Matching thesis at the Intelligent Vehicles Group of the TU Delft.

Language:Jupyter NotebookMIT2 10