Img2Loc, a novel system that redefines image geolocalization as a text generation task using cutting-edge large multi-modality models (LMMs) and retrieval-augmented generation, significantly outperforms previous state-of-the-art methods without any model training.
A novel deep multi-task approach that achieves state-of-the-art performance on a wide range of image geolocalization benchmarks while being robust to distribution shifts.