Improving Bird's Eye View Semantic Segmentation by Decomposing the Task into Generation and Perception
A two-stage method is proposed to decompose the traditional end-to-end bird's eye view semantic segmentation task into a BEV autoencoder for generation and an RGB-BEV alignment module for perception, which simplifies the complexity and improves the performance.