AS-ES learning introduces a new training paradigm for small models to improve CoT learning efficiency. The method involves segmenting CoT data into Extractive Segments (ES) and Abstractive Segments (AS). This approach enhances logical reasoning capabilities without altering the model or requiring extra data. Experimental results show improved performance on tasks like Math Word Problems and PET summarization. The study explores the impact of segmentation strategies, model sizes, and hyperparameters on the effectiveness of AS-ES learning.
Til et andet sprog
fra kildeindhold
arxiv.org
Vigtigste indsigter udtrukket fra
by Nuwa Xi,Yuha... kl. arxiv.org 03-05-2024
https://arxiv.org/pdf/2403.01969.pdfDybere Forespørgsler