Probing Large Language Models for Arithmetic Reasoning Capabilities
Large Language Models struggle to perform basic arithmetic reasoning over implicitly held numerical knowledge, despite making progress in knowledge acquisition and statistical inference.