The content presents MemFlow, a novel architecture for real-time optical flow estimation and prediction that utilizes a memory module. Key highlights:
MemFlow maintains a memory buffer to store historical motion states of the video, along with an efficient update and read-out process that retrieves useful motion information for the current frame's optical flow estimation.
MemFlow incorporates a resolution-adaptive re-scaling technique in the attention mechanism, enhancing cross-resolution generalization performance.
MemFlow achieves state-of-the-art or near-SOTA performance on various optical flow estimation benchmarks, including Sintel and KITTI, while demonstrating exceptional efficiency with minimal computational overhead.
MemFlow can be repurposed for optical flow future prediction with minimal changes, achieving competitive results in video prediction without specific training for this downstream task.
The authors make four key contributions: (1) an innovative real-time optical flow estimation architecture with a memory module, (2) a resolution-adaptive re-scaling technique, (3) superior optical flow estimation performance, and (4) future prediction capability without explicit training.
翻译成其他语言
从原文生成
arxiv.org
更深入的查询