The paper introduces ICXML, a two-stage framework for zero-shot extreme multi-label classification (XMC) tasks.
In the first stage, ICXML generates a set of candidate labels through in-context learning using large language models (LLMs). It does this by constructing demonstrations that capture the inherent correlation between the input text and the label space, as well as external knowledge. The generated labels are then mapped to the actual label space to create a condensed shortlist of candidate labels.
In the second stage, ICXML utilizes the LLM's ability to handle multiple labels concurrently to perform listwise reranking on the candidate label shortlist, producing the final predictions.
The authors evaluate ICXML on two diverse public benchmarks, LF-Amazon-131K and LF-WikiSeeAlso-320K, and show that it advances the state of the art in zero-shot XMC. They also provide detailed analyses to understand the contributions of different components of the framework.
The key highlights of the paper are:
Para outro idioma
do conteúdo fonte
arxiv.org
Principais Insights Extraídos De
by Yaxin Zhu,Ha... às arxiv.org 04-16-2024
https://arxiv.org/pdf/2311.09649.pdfPerguntas Mais Profundas