核心概念
A novel graph-matching-based approach, FreeAlign, enables robust collaborative perception without requiring external localization and clock devices.
摘要
The paper proposes a robust collaborative perception system that operates independently of external devices for localization and clock synchronization. The key module, FreeAlign, leverages graph matching techniques to identify similar geometric patterns within the perceptual data of various agents, ensuring accurate alignment in both spatial and temporal domains.
FreeAlign comprises three key components:
- Salient-object graph learning: A Graph Neural Network (GNN) is used to capture comprehensive edge features among the salient objects detected by each agent.
- Multi-anchor-based subgraph searching: FreeAlign identifies the approximate maximum common subgraph across two salient-object graphs, signifying distinct and similar geometric structures.
- Relative transformation calculation: The common subgraph is leveraged to calculate the relative pose and latency between two collaborative messages.
The proposed system offers two key advantages: 1) it provides a machine learning approach to substitute global localization and synchronized devices, substantially bolstering the robustness of collaborative perception; and 2) FreeAlign can be seamlessly integrated with numerous established methods without necessitating retraining of the collaborative perception architecture.
Extensive experiments on both simulated and real-world datasets demonstrate that FreeAlign-empowered collaborative perception systems perform comparably to those relying on precise localization and clock devices, even in the presence of pose errors, latency deviations, and malicious attacks.
統計資料
The average relative pose error between agents is 0.266m and 0.318m on the OPV2V and DAIR-V2X datasets, respectively.
The average clock deviation between agents is 22.8ms and 45.6ms on the OPV2V and DAIR-V2X datasets, respectively.
引述
"FreeAlign's independence from prior pose information makes it less susceptible to the impacts of pose noise."
"With FreeAlign's assistance, the ego vehicle successfully detects through the T-junction, where its solo detection performance is suboptimal."