Concepts de base
A two-stage framework that leverages in-context learning to generate and rerank candidate labels for zero-shot extreme multi-label classification tasks.
Résumé
The paper introduces ICXML, a two-stage framework for zero-shot extreme multi-label classification (XMC) tasks.
In the first stage, ICXML generates a set of candidate labels through in-context learning using large language models (LLMs). It does this by constructing demonstrations that capture the inherent correlation between the input text and the label space, as well as external knowledge. The generated labels are then mapped to the actual label space to create a condensed shortlist of candidate labels.
In the second stage, ICXML utilizes the LLM's ability to handle multiple labels concurrently to perform listwise reranking on the candidate label shortlist, producing the final predictions.
The authors evaluate ICXML on two diverse public benchmarks, LF-Amazon-131K and LF-WikiSeeAlso-320K, and show that it advances the state of the art in zero-shot XMC. They also provide detailed analyses to understand the contributions of different components of the framework.
The key highlights of the paper are:
- Introducing a two-stage framework for zero-shot XMC, involving generation-based label shortlisting and label reranking.
- Advocating for a generation-based approach to yield high-quality input-label pairs, addressing the challenges posed by the absence of specific input scenarios.
- Advancing the state of the art in zero-shot XMC on two public benchmarks and providing detailed analysis for a deeper understanding of model performance.
Stats
The LF-Amazon-131K dataset has 294,805 training instances, 134,835 test instances, and 131,073 labels.
The LF-WikiSeeAlso-320K dataset has 693,082 training instances, 177,515 test instances, and 312,330 labels.
Citations
"While existing research has primarily focused on supervised XMC, real-world applications often encounter challenges in obtaining complete supervision signals."
"To address this issue, we put together the benefits of both retrieval- and generation-based approaches by introducing ICXML– a two-stage framework designed for zero-shot XMC."
"Extensive experiments suggest that ICXML advances the state of the art on two diverse public benchmarks."