State-Space Decomposition Model for Stochastic Video Prediction with Long-Term Motion Trend Consideration
A state-space decomposition model that decomposes video prediction into deterministic appearance prediction and stochastic motion prediction, with the incorporation of long-term motion trend to guide the generation of future frames.