The content presents a novel approach for learning object dynamics using point-based convolutional neural networks. The key highlights are:
The authors propose two specialized convolution operators - Object PointConv and Relational PointConv - to model within-object and between-object interactions, respectively. Object PointConv propagates effects within the same object, while Relational PointConv captures interactions across different objects.
The authors assemble these convolution operators into a U-Net architecture, which enables hierarchical feature learning and long-range interaction modeling. The U-Net encoder downsamples the point cloud while the decoder upsamples it back, allowing the network to capture both local and global scene dynamics.
For mesh-based inputs, the authors introduce an approach to compute features at interaction points on mesh faces, which are then propagated to the mesh vertices. This allows the model to reason about face-to-face collisions effectively.
Experiments on the Physion and Kubric benchmarks show that the proposed point-based approach outperforms state-of-the-art graph neural network methods, especially in scenarios involving gravity and collisions. The authors demonstrate the benefits of using continuous point convolutions over message passing in graph networks for learning object dynamics.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Chanho Kim,L... lúc arxiv.org 04-10-2024
https://arxiv.org/pdf/2404.06044.pdfYêu cầu sâu hơn