Predicting Information Retrieval Performance Using Relevance Judgments Generated by Large Language Models
The core message of this paper is to propose a novel query performance prediction (QPP) framework, called QPP-GenRE, which decomposes QPP into independent subtasks of automatically generating relevance judgments using large language models (LLMs). QPP-GenRE can predict various IR evaluation measures based on the generated relevance judgments, and provides interpretable insights into QPP outputs.