AS-ES learning introduces a new training paradigm for small models to improve CoT learning efficiency. The method involves segmenting CoT data into Extractive Segments (ES) and Abstractive Segments (AS). This approach enhances logical reasoning capabilities without altering the model or requiring extra data. Experimental results show improved performance on tasks like Math Word Problems and PET summarization. The study explores the impact of segmentation strategies, model sizes, and hyperparameters on the effectiveness of AS-ES learning.
Naar een andere taal
vanuit de broninhoud
arxiv.org
Belangrijkste Inzichten Gedestilleerd Uit
by Nuwa Xi,Yuha... om arxiv.org 03-05-2024
https://arxiv.org/pdf/2403.01969.pdfDiepere vragen