The paper introduces ICXML, a two-stage framework for zero-shot extreme multi-label classification (XMC) tasks.
In the first stage, ICXML generates a set of candidate labels through in-context learning using large language models (LLMs). It does this by constructing demonstrations that capture the inherent correlation between the input text and the label space, as well as external knowledge. The generated labels are then mapped to the actual label space to create a condensed shortlist of candidate labels.
In the second stage, ICXML utilizes the LLM's ability to handle multiple labels concurrently to perform listwise reranking on the candidate label shortlist, producing the final predictions.
The authors evaluate ICXML on two diverse public benchmarks, LF-Amazon-131K and LF-WikiSeeAlso-320K, and show that it advances the state of the art in zero-shot XMC. They also provide detailed analyses to understand the contributions of different components of the framework.
The key highlights of the paper are:
toiselle kielelle
lähdeaineistosta
arxiv.org
Tärkeimmät oivallukset
by Yaxin Zhu,Ha... klo arxiv.org 04-16-2024
https://arxiv.org/pdf/2311.09649.pdfSyvällisempiä Kysymyksiä