Retrosynthetic planning is crucial in drug discovery and materials design. The PDVN algorithm enhances single-step predictors by optimizing complete routes, leading to improved success rates and shorter synthesis routes on datasets like USPTO.
The combination of ML-based single-step reaction predictors with multi-step planners has shown promising results. However, existing methods often focus on optimizing single-step accuracy without considering complete routes. To address this limitation, the authors propose a novel online training algorithm called Planning with Dual Value Networks (PDVN).
PDVN leverages reinforcement learning to enhance the single-step predictor by using a tree-shaped Markov Decision Process (MDP) to optimize complete synthesis routes. The algorithm alternates between planning and updating phases, constructing two separate value networks to predict synthesizability and cost of molecules.
Experiments on the USPTO dataset demonstrate that PDVN significantly improves search success rates of existing multi-step planners like Retro* and RetroGraph. The algorithm not only increases success rates but also finds shorter synthesis routes, showcasing its effectiveness in enhancing retrosynthetic planning.
Further research could explore extending PDVN to other single-step models for broader applicability in chemical synthesis planning. Additionally, investigating the impact of different cost functions on route quality could provide valuable insights into optimizing retrosynthesis algorithms.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Guoqing Liu,... at arxiv.org 03-05-2024
https://arxiv.org/pdf/2301.13755.pdfDeeper Inquiries