Img2Loc: Leveraging Multi-Modality Foundation Models and Retrieval-Augmented Generation for Accurate Image Geolocalization
Img2Loc, a novel system that redefines image geolocalization as a text generation task using cutting-edge large multi-modality models (LMMs) and retrieval-augmented generation, significantly outperforms previous state-of-the-art methods without any model training.