Enhancing Multilingual Reasoning Capabilities of Large Language Models through Question Alignment Training
The two-stage training framework of question alignment and response alignment can effectively enable large language models to leverage their English expertise to improve performance on multilingual reasoning tasks across diverse scenarios, including math reasoning with chain-of-thought, math reasoning with executable code, and common sense reasoning.