Challenges in Multimodal Reasoning with Algorithmic Puzzles
The author introduces a novel task of multimodal puzzle solving, highlighting the limitations of large language models in solving algorithmic puzzles that require visual understanding, language comprehension, and complex reasoning.