Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles

Song, Rui; Liang, Chenwei; Cao, Hu; Yan, Zhiran; Zimmer, Walter; Gross, Markus; Festag, Andreas; Knoll, Alois

Title:: Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles
Document type:: Konferenzbeitrag
Contribution type:: Textbeitrag / Aufsatz
Author(s):: Song, Rui; Liang, Chenwei; Cao, Hu; Yan, Zhiran; Zimmer, Walter; Gross, Markus; Festag, Andreas; Knoll, Alois
Abstract:: Collaborative perception in automated vehicles leverages the exchange of information between agents, aiming to elevate perception results. Previous camera-based collaborative 3D perception methods typically employ 3D bounding boxes or bird's eye views as representations of the environment. However, these approaches fall short in offering a comprehensive 3D environmental prediction. To bridge this gap, we introduce the first method for collaborative 3D semantic occupancy prediction. Particularly, it improves local 3D semantic occupancy predictions by hybrid fusion of (i) semantic and occupancy task features, and (ii) compressed orthogonal attention features shared between vehicles. Additionally, due to the lack of a collaborative perception dataset designed for semantic occupancy prediction, we augment a current collaborative perception dataset to include 3D collaborative semantic occupancy labels for a more robust evaluation. The experimental findings highlight that: (i) our collaborative semantic occupancy predictions excel above the results from single vehicles by over 30%, and (ii) models anchored on semantic occupancy outpace state-of-the-art collaborative 3D detection techniques in subsequent perception applications, showcasing enhanced accuracy and enriched semantic-awareness in road environments. «
Collaborative perception in automated vehicles leverages the exchange of information between agents, aiming to elevate perception results. Previous camera-based collaborative 3D perception methods typically employ 3D bounding boxes or bird's eye views as representations of the environment. However, these approaches fall short in offering a comprehensive 3D environmental prediction. To bridge this gap, we introduce the first method for collaborative 3D semantic occupancy prediction. Particularly,... »
Keywords:: Autonomous Driving, Cooperative Perception, V2X, Semantic Occupancy Prediction, Semantic Scene Completion
Dewey Decimal Classification:: 000 Informatik, Wissen, Systeme
Editor:: IEEE/CVF
Book / Congress title:: 2024 IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR)
Date of congress:: 17.06.2024
Year:: 2024
Reviewed:: ja
Language:: en
TUM Institution:: Chair of Robotics, Artificial Intelligence and Real-Time Systems
Format:: Text
BibTeX