Core Concepts
The author explores the effectiveness of zero-shot stance detection using FlanT5-XXL, demonstrating its ability to match or surpass state-of-the-art benchmarks without fine-tuning. The study delves into various factors affecting performance, including prompts, instructions, and decoding strategies.
Abstract
The study investigates zero-shot stance detection using FlanT5-XXL on Twitter datasets. It highlights the model's performance against baselines, sensitivity to prompts and instructions, and the impact of decoding strategies. Results show competitive performance and insights into optimizing zero-shot stance detection tasks.
Key points:
- Investigating zero-shot stance detection with FlanT5-XXL on Twitter datasets.
- Comparing model performance against strong baselines in SemEval 2016 Task 6A, 6B, and P-Stance.
- Analyzing sensitivity to prompts, instructions, and decoding strategies.
- Demonstrating competitive performance and potential for optimization in zero-shot stance detection tasks.
Stats
"FlanT5-XXL shows state-of-the-art performance on SemEval 2016 Task 6B."
"Performance is close to SoTA on Task 6A and exceeds it in Task 6B."
"Greedy decoding strategy offers competitive performance across different prompts."
Quotes
"The zero-shot approach can match or outperform state-of-the-art benchmarks."
"FlanT5-XXL demonstrates impressive performance across various tasks."
"The model is not sensitive to instruction paraphrasing but negatively affected by opposition or negation in prompts."