Enhancing Interactive Image Retrieval Through Query Rewriting Using Large Language Models and Vision Language Models
An interactive image retrieval system that refines queries based on user relevance feedback, incorporating a vision language model to enhance text-based queries and a large language model to denoise query expansions, achieving state-of-the-art performance.