核心概念
A framework to incubate text classifiers by leveraging instruction-tuned large language models, enabling the generation of customized classifiers following user preferences.
要約
The paper proposes a novel framework called "Incubator" to generate text classifiers based on user instructions. The key ideas are:
-
Instruction-Tuning: The authors collect instruction-data pairs from public classification datasets and use in-context learning (ICL) to fine-tune a large language model (LLM) as the "Incubator". This allows the Incubator to generate training data for text classifiers according to user-provided instructions.
-
Self-Diversification: To address the potential bias and lack of diversity in the generated data, the authors introduce a self-diversification technique. It utilizes a text embedder to identify semantically diverse samples and incorporates them into the instruction-tuning process.
The experiments demonstrate that the Incubator can:
- Outperform strong baselines on traditional text classification benchmarks.
- Handle complex label definitions, including "Other" classes and logical conjunctions.
- Incubate classifiers that satisfy personalized user preferences for text mining.
The authors also provide comprehensive analyses on the efficiency, robustness, and scalability of the Incubator framework.
統計
The average time for dataset generation is 67.53 seconds.
The average time for classifier incubation (fine-tuning) is 15.16 seconds per class.
引用
"We argue that the LLMs need further instruction-tuning (Ouyang et al., 2022), particularly for classification data generation."
"Our work follows this trend to instruction-tune LLMs as Incubator, which customize classifiers according to user instructions."
"Experiment results verify our Incubator to be able to (1) incubate strong text classifiers that outperform the baselines, (2) consider the label interdependency and follow the user preference in the instruction, (3) incubate multiple text classifiers and use logical conjunctions to realize advanced text mining systems."