Introducing the Retrieval-Augmented Planner (RAP) model for adaptive procedure planning in instructional videos, overcoming challenges of fixed-length predictions and enhancing performance.
The author argues that enhancing an agent's capabilities with procedural knowledge from training procedure plans can significantly improve the effectiveness of procedure planning in instructional videos.