LLMs exhibit varying levels of math reasoning abilities but lack robustness, especially when faced with question variations.
LLMs exhibit varying levels of math reasoning abilities but lack robustness in solving math word problems.