Simultaneous Object Detection and Distance Estimation for Indoor Autonomous Vehicles

Azurmendi, Iker; Zulueta Guerrero, Ekaitz; López Guede, José Manuel; González, Manuel

View/Open

electronics-12-04719.pdf (3.826Mb)

Date

2023-11-21

Author

Azurmendi, Iker

Zulueta Guerrero, Ekaitz

López Guede, José Manuel

González, Manuel

Metadata

Show full item record

Estadisticas en RECOLECTA
(LA Referencia)

Electronics 12(23) : (2023) // Article ID 4719

URI

http://hdl.handle.net/10810/63391

Abstract

Object detection is an essential and impactful technology in various fields due to its ability to automatically locate and identify objects in images or videos. In addition, object-distance estimation is a fundamental problem in 3D vision and scene perception. In this paper, we propose a simultaneous object-detection and distance-estimation algorithm based on YOLOv5 for obstacle detection in indoor autonomous vehicles. This method estimates the distances to the desired obstacles using a single monocular camera that does not require calibration. On the one hand, we train the algorithm with the KITTI dataset, which is an autonomous driving vision dataset that provides labels for object detection and distance prediction. On the other hand, we collect and label 100 images from a custom environment. Then, we apply data augmentation and transfer learning to generate a fast, accurate, and cost-effective model for the custom environment. The results show a performance of mAP0.5:0.95 of more than 75% for object detection and 0.71 m of mean absolute error in distance prediction, which are easily scalable with the labeling of a larger amount of data. Finally, we compare our method with other similar state-of-the-art approaches.

Collections

Artículos

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Except where otherwise noted, this item's license is described as © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).