核心概念
Proposing SMC-Wake as an alternative to RWS for fitting amortized variational approximations, providing consistent gradient estimators and accurate posterior approximations.
要約
The content introduces Sequential Monte Carlo (SMC) for inclusive KL minimization in amortized variational inference. It compares SMC-Wake to Reweighted Wake-Sleep (RWS) and discusses the challenges of minimizing the forward KL divergence. The method proposes three gradient estimators, highlighting their unbiasedness and consistency. Experimental results on simulated and real datasets demonstrate the effectiveness of SMC-Wake in fitting accurate variational distributions approximating the posterior.
1. Introduction
- Training encoder network for amortized variational inference.
- Challenges in minimizing forward KL divergence.
- Proposal of SMC-Wake as an alternative to RWS.
2. Background
- Overview of Reweighted Wake-Sleep (RWS).
- Issues with biased gradients and concentrated distributions.
- Proposal of SMC-Wake using likelihood-tempered SMC samplers.
3. Mass Concentration in RWS
- Circular pathology leading to degenerate variational distributions.
- Proposed solution: using prior as base distribution in LT-SMC.
4. SMC-WAKE
- Methodology for fitting amortized encoder with sequential Monte Carlo.
- Three gradient estimators proposed, emphasizing unbiasedness and consistency.
5. Related Work
- Comparison with VSMC, FIVO, AESMC, MSC, NASMC, AFT, CR-AFT, NVI methods.
6. Experiments
Two moons:
- Comparison of SMC-Wake with RWS on a simulated dataset.
Avoiding mode collapse in MNIST:
- Illustration of wake-phase training issues on MNIST dataset.
Transformed Gaussian:
- Comparison between MSC and SMC-PIMH-Wake on a hierarchical model.
Galaxy spectra emulator:
- Emulator training using PROVABGS simulator and comparison with MCMC results.
統計
"Reweighted Wake-Sleep (RWS) suffers from heavily biased gradients."
"SMC-Wake fits variational distributions accurately compared to existing methods."
引用
"The circular pathology can lead to degenerate mass concentration."
"SMC-Wake provides consistent gradient estimates for forward KL minimization."