The authors propose a novel Sparse Query-Centric paradigm for end-to-end Autonomous Driving (SparseAD) to address the limitations of previous dense BEV-Centric end-to-end methods.
In the Sparse Perception module, perception tasks including detection, tracking and online mapping are unified in a sparse manner, where multi-sensor features and historical memories are regarded as tokens, and object queries and map queries represent obstacles and road elements respectively. This avoids the heavy computational and memory burden of dense BEV representations.
In the Motion Planner, with sparse perception queries as environmental representation, the authors apply multi-modal motion prediction for both ego-vehicle and surrounding agents simultaneously, and fully consider driving constraints of multiple dimensions to generate the final planning.
Experiments on the nuScenes dataset show that SparseAD achieves state-of-the-art full-task performance among end-to-end methods and significantly narrows the gap between end-to-end paradigms and single-task methods, demonstrating the great potential of the sparse query-centric paradigm.
Para outro idioma
do conteúdo fonte
arxiv.org
Principais Insights Extraídos De
by Diankun Zhan... às arxiv.org 04-11-2024
https://arxiv.org/pdf/2404.06892.pdfPerguntas Mais Profundas