Large Language Models Struggle to Detect Unreasonable Math Problems
Large language models (LLMs) demonstrate significant capabilities in solving math problems, but they tend to produce hallucinations when given questions containing unreasonable errors.