A Paper List for Geo-localization Research.
Given a single image of streetview / scenery, Geo-localization Task is to predict the actural location (continent / country / region / city / street / geographic coordinates) of the image.
Retrieval based methods consider Geo-localization as a retrieval task. Generally, treating the input image as a query, the retrieval task is to map the image to the most similar image in a gallery of photos worldwide. Or, we can map the input image to the most similar location in a gallery of locations worldwide.
Classification based methods consider Geo-localization as a classification task. The classification method subdivids the earth’s surface into a high number of geo-cells (in different granularities, continent, country, region, city, street, ...) and assigning each input image to one geo-cell.
With multi-modal learning methods like CLIP (Contrastive Language-Image Pre-training), models can learn the relations between representations of location labels and images, to better predict the accurate location with classification or generation methods.
Title | Venue | Code | Demo |
---|---|---|---|
Image and Object Geo-Localization | IJCV 2023 | - | - |
Title | Venue | Code | Demo |
---|---|---|---|
OpenStreetView-5M: The Many Roads to Global Visual Geolocation | CVPR 2024 | Github | - |
LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild | CVPR 2024 Workshop | - | - |
CityBench: Evaluating the Capabilities of Large Language Model as World Model | arXiv 2024 | - | - |
Title | Link | Source Link |
---|---|---|
OpenStreetView-5M | DownloadLink | |
im2GPS3k | DownloadLink | SourceLink |
YFCC4K | DownloadLink | |
YFCC26K | DownloadLink | SourceLink1 SourceLink2 |