Compact Occupancy TRansformer for Efficient and Accurate 3D Occupancy Prediction
The authors propose Compact Occupancy TRansformer (COTR), a method that constructs a compact and geometry-aware 3D occupancy representation through efficient explicit-implicit view transformation, and further enhances its semantic discriminability using a coarse-to-fine semantic grouping strategy.