Heuristic-Guided Multi-Step Reasoning with Large Language Models: A POMDP Approach
A novel planning-based approach, called Plan of Thoughts (PoT), that leverages a large language model's self-reflective reasoning capabilities to guide multi-step problem solving through a Partially Observable Markov Decision Process (POMDP) formulation.