InfraDet3D: Multi-Modal 3D Object Detection based on Roadside Infrastructure Camera and LiDAR Sensors

Zimmer, Walter; Birkner, Joseph; Brucker, Marcel; Nguyen, Huu Tung; Petrovski, Stefan; Wang, Bohan; Knoll, Alois C.

doi:10.1109/IV55152.2023.10186723

Wenn Sie Schwierigkeiten haben, das Dokument zu öffnen, versuchen Sie auch bitte diesen Link

Titel:: InfraDet3D: Multi-Modal 3D Object Detection based on Roadside Infrastructure Camera and LiDAR Sensors
Dokumenttyp:: Zeitschriftenaufsatz
Autor(en):: Zimmer, Walter; Birkner, Joseph; Brucker, Marcel; Nguyen, Huu Tung; Petrovski, Stefan; Wang, Bohan; Knoll, Alois C.
Abstract:: Current multi-modal object detection approaches focus on the vehicle domain and are limited in the perception range and the processing capabilities. Roadside sensor units (RSUs) introduce a new domain for perception systems and leverage altitude to observe traffic. Cameras and LiDARs mounted on gantry bridges increase the perception range and produce a full digital twin of the traffic. In this work, we introduce InfraDet3D, a multi-modal 3D object detector for roadside infrastructure sensors. We fuse two LiDARs using early fusion and further incorporate detections from monocular cameras to increase the robustness and to detect small objects. Our monocular 3D detection module uses HD maps to ground object yaw hypotheses, improving the final perception results. The perception framework is deployed on a real-world intersection that is part of the A9 Test Stretch in Munich, Germany. We perform several ablation studies and experiments and show that fusing two LiDARs with two cameras leads to an improvement of +1.90 mAP compared to a camera-only solution. We evaluate our results on the A9 infrastructure dataset and achieve 68.48 mAP on the test set. The dataset and code will be available at this https URL to allow the research community to further improve the perception results and make autonomous driving safer. «
Current multi-modal object detection approaches focus on the vehicle domain and are limited in the perception range and the processing capabilities. Roadside sensor units (RSUs) introduce a new domain for perception systems and leverage altitude to observe traffic. Cameras and LiDARs mounted on gantry bridges increase the perception range and produce a full digital twin of the traffic. In this work, we introduce InfraDet3D, a multi-modal 3D object detector for roadside infrastructure sensors. We... »
Stichworte:: Autonomous Driving, Deep Learning, Perception, Object Detection, Roadside Sensors, Camera, LiDAR, Fusion
Dewey Dezimalklassifikation:: 000 Informatik, Wissen, Systeme
Kongresstitel:: 2023 IEEE Intelligent Vehicles Symposium (IV)
Zeitschriftentitel:: 2023 IEEE Proceedings of Intelligent Vehicles Symposium (IV)
Jahr:: 2023
Jahr / Monat:: 2023-06
Monat:: Jun
Seitenangaben Beitrag:: 8
Volltext / DOI:: doi:10.1109/IV55152.2023.10186723
WWW:: https://ieeexplore.ieee.org/document/10186723
Semester:: SS 23
TUM Einrichtung:: School of Computation, Information and Technology
BibTeX

Vorkommen:

mediaTUM Gesamtbestand Hochschulbibliographie 2023 Schools und Fakultäten TUM School of Computation, Information and Technology Informatik 6 - Lehrstuhl für Robotik, Künstliche Intelligenz und Echtzeitsysteme (Prof. Knoll)

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Computation, Information and Technology Departments Computer Engineering Informatik 6 - Lehrstuhl für Robotik, Künstliche Intelligenz und Echtzeitsysteme (Prof. Knoll)2023