Enhancing arithmetic reasoning in Large Language Models through query-dependent prompt optimization using Offline Inverse RL.