Retrosynthetic Planning with Dual Value Networks: Enhancing Synthesis Routes
In creating the PDVN algorithm, the authors aim to improve single-step predictors by considering complete synthesis routes through reinforcement learning. The approach involves using dual value networks to optimize synthesizability and cost predictions.