Optimizing Probability Distributions with Surrogate Natural Gradients
The author proposes a novel technique for optimizing probability distribution parameters by reframing the optimization as one with respect to a surrogate distribution, making computing natural gradients easier.