Large Language Models Can Improve Reasoning Abilities by Learning from Correct Steps
A novel intrinsic self-correction framework, Learning from Correctness (LECO), can significantly improve the reasoning performance of large language models across various tasks by progressively accumulating correct reasoning steps without relying on external feedback or handcrafted prompts.