Generative AI models, despite industry hype, still struggle significantly with planning tasks, exposing the gap between perceived and actual capabilities.