Large language models struggle to perform human-like planning tasks, revealing the need for further research and improvement.
Language models struggle to perform human-like planning tasks, revealing the limitations of current models.