AS-ES learning introduces a new training paradigm for small models to improve CoT learning efficiency. The method involves segmenting CoT data into Extractive Segments (ES) and Abstractive Segments (AS). This approach enhances logical reasoning capabilities without altering the model or requiring extra data. Experimental results show improved performance on tasks like Math Word Problems and PET summarization. The study explores the impact of segmentation strategies, model sizes, and hyperparameters on the effectiveness of AS-ES learning.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Nuwa Xi,Yuha... lúc arxiv.org 03-05-2024
https://arxiv.org/pdf/2403.01969.pdfYêu cầu sâu hơn