Core Concepts
Local Search GFlowNets enhances training effectiveness by leveraging local search in object space, improving mode seeking and average reward performance.
Abstract
1. Introduction:
Generative Flow Networks (GFlowNets) learn reward-proportional distributions over objects.
Emphasize exploration for diverse high-reward samples.
Struggle with over-exploration hindering training efficiency.
2. Main Intuition for LS-GFN:
Introduces a novel algorithm, LS-GFN, combining inter-mode global exploration and intra-mode local exploration.
Synergistically refines trajectories using local search to enhance training effectiveness.
3. Training Process:
Step A: Sampling complete trajectories using GFlowNet policies.
Step B: Refining trajectories through iterative backtracking and reconstruction guided by backward and forward policies.
Step C: Training GFlowNet using revised trajectories with reward prioritized sampling.
4. Experimental Results:
Significant improvement in biochemical tasks observed with LS-GFN.
Outperforms prior methods in mode seeking and average reward performance.
Stats
LS-GFNは、複数の生物化学的タスクで顕著なパフォーマンス向上を実証しています。
Quotes
"LS-GFN has the fastest mode mixing capability among GFlowNet baselines and RL baselines."
"Our method outperforms baselines and matches the target distribution in most cases."