This content is not included in
your SAE MOBILUS subscription, or you are not logged in.
KDepthNet: Mono-Camera Based Depth Estimation for Autonomous Driving
Technical Paper
2022-01-0082
ISSN: 0148-7191, e-ISSN: 2688-3627
Annotation ability available
Sector:
Language:
English
Abstract
Object avoidance for autonomous driving is a vital factor in safe driving. When a vehicle travels from any random start places to any target positions in the milieu, an appropriate route must prevent static and moving obstacles. Having the accurate depth of each barrier in the scene can contribute to obstacle prevention. In recent years, precise depth estimation systems can be attributed to notable advances in Deep Neural Networks and hardware facilities/equipment. Several depth estimation methods for autonomous vehicles usually utilize lasers, structured light, and other reflections on the object surface to capture depth point clouds, complete surface modeling, and estimate scene depth maps. However, estimating precise depth maps is still challenging due to the computational complexity and time-consuming process issues. On the contrary, image-based depth estimation approaches have recently come to attention and can be applied for a broad range of applications. A vast majority of camera depth estimation methods intend to determine the depth map of the whole input image using binocular cameras or a 3D camera, which is time-consuming too. In this paper, a novel approach is proposed that predicts the depth of the head obstacle using only a 2D mono camera. The bounding boxes of barriers are extracted through a deep neural network at the first stage. Rather than those methods, which calculate the depth map of the entire image pixels, in this paper, the average depth of each bounding box is calculated and assigned as labels. Then labels and feature vectors (four values of the bounding box) are set as input data of the proposed method. This network maps feature vectors of the previous stage to the estimated depth values. The results suggest that the model can reasonably predict the depths of obstacles on the Kitti dataset.
Authors
Citation
Tavakolian, N., Fekri, P., Zadeh, M., and Dargahi, J., "KDepthNet: Mono-Camera Based Depth Estimation for Autonomous Driving," SAE Technical Paper 2022-01-0082, 2022, https://doi.org/10.4271/2022-01-0082.Also In
References
- Abedi , V. , Zadeh , M.H. , Dargahi , J. , and Fekri , P. Software Failures Prediction in Self-Driving Vehicles 2020 IEEE 92nd Vehicular Technology Conference (VTC2020-Fall) 2020 1 5 10.1109/VTC2020-Fall49728.2020.9348849
- Alam , M. , Samad , M.D. , Vidyaratne , L. , Glandon , A. et al. Survey on Deep Neural Networks in Speech and Vision Systems Neurocomputing 417 2020 302 321
- de Queiroz Mendes , R. , Ribeiro , E.G. , dos Santos Rosa , N. , and Grassi , V. On Deep Learning Techniques to Boost Monocular Depth Estimation for Autonomous Navigation Robotics and Autonomous Systems 136 2021 103701 https://doi.org/10.1016/j.robot.2020.103701
- Harisankar , V. , Sajith , V.V.V. , and Soman , K.P. Unsupervised Depth Estimation from Monocular Images for Autonomous Vehicles 2020 Fourth International Conference on Computing Methodologies and Communication (ICCMC) 2020 904 909 10.1109/ICCMC48092.2020.ICCMC-000167
- Huang , W. , Cheng , J. , Yang , Y. , and Guo , G. An Improved Deep Convolutional Neural Network with Multi-Scale Information for Bearing Fault Diagnosis Neurocomputing 359 2019 77 92 https://doi.org/10.1016/j.neucom.2019.05.052
- Meinel , H.H. Radarsensors and Autonomous Driving—Yesterday, Today and Tomorrow! Elektrotechnik und Informationstechnik 135 4 2018 370 377 10.1007/s00502-018-0627-2
- Lowe , D.G. Object Recognition from Local Scale-Invariant Features Proceedings of the Seventh IEEE International Conference on Computer Vision 1999 2 1150 1157 10.1109/ICCV.1999.790410
- Bosch , A. , Zisserman , A. , and Munoz , X. Image Classification using Random Forests and Ferns 2007 IEEE 11th International Conference on Computer Vision 2007 1 8 10.1109/ICCV.2007.4409066
- Ming , Y. , Meng , X. , Fan , C. , and Yu , H. Deep Learning for Monocular Depth Estimation: A Review Neurocomputing 438 2021 14 33 https://doi.org/10.1016/j.neucom.2020.12.089
- Zhang , Y. , Tiňo , P. , Leonardis , A. , and Tang , K. A Survey on Neural Network Interpretability IEEE Transactions on Emerging Topics in Computational Intelligence 5 5 2021 726 742 10.1109/TETCI.2021.3100641
- Gupta , A. , Anpalagan , A. , Guan , L. , and Khwaja , A.S. Deep Learning for Object Detection and Scene Perception in Self-Driving Cars: Survey, Challenges, and Open Issues Array 10 2021 100057 https://doi.org/10.1016/j.array.2021.100057
- He , K. , Zhang , X. , Ren , S. , and Sun , J. Deep Residual Learning for Image Recognition 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016 770 778 10.1109/CVPR.2016.90
- Huang , G. , Liu , Z. , Van Der Maaten , L. , and Weinberger , K.Q. Densely Connected Convolutional Networks Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2017 4700 4708
- Simonyan , K. and Zisserman , A. Very Deep Convolutional Networks for Large-Scale Image Recognition CoRR abs/1409.1 2014
- Hochreiter , S. and Schmidhuber , J. Long Short-Term Memory Neural Computation 9 8 1735 1780 1997 10.1162/neco.1997.9.8.1735
- Gregor , K. , Danihelka , I. , Graves , A. , Rezende , D. , and Wierstra , D. Draw: A Recurrent Neural Network for Image Generation International Conference on Machine Learning 2015 1462 1471
- Wang , L. , Li , W. , Li , W. , and Van Gool , L. Appearance-and-Relation Networks for Video Classification 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018 1430 1439 10.1109/CVPR.2018.00155
- Gwn Lore , K. , Reddy , K. , Giering , M. , and Bernal , E.A. Generative Adversarial Networks for Depth Map Estimation from RGB Video Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops 2018 1177 1185
- Ummenhofer , B. , Zhou , H. , Uhrig , J. , Mayer , N. , et al. Demon: Depth and Motion Network for Learning Monocular Stereo Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2017 5038 5047
- Mayer , N. , Ilg , E. , Häusser , P. , Fischer , P. , et al. A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016 4040 4048 10.1109/CVPR.2016.438
- Geiger , A. , Lenz , P. , and Urtasun , R. Are We Ready for Autonomous Driving? The KITTI Vision Benchmark Suite 2012 IEEE Conference on Computer Vision and Pattern Recognition 2012 3354 3361 10.1109/CVPR.2012.6248074
- Ren , S. , He , K. , Girshick , R. , and Sun , J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Advances in Neural Information Processing Systems 28 2015
- Fekri , P. , Abedi , V. , Dargahi , J. , and Zadeh , M. A Forward Collision Warning System Using Deep Reinforcement Learning SAE Technical Paper 2020-01-0138 2020 https://doi.org/10.4271/2020-01-0138
- Godard , C. , Mac Aodha , O. , and Brostow , G.J. Unsupervised Monocular Depth Estimation with Left-Right Consistency CoRR abs/1609.0 2016
- Liu , P. , Zhang , Z. , Meng , Z. , and Gao , N. Monocular Depth Estimation with Joint Attention Feature Distillation and Wavelet-Based Loss Function Sensors 21 1 2021 54
- Eigen , D. , Puhrsch , C. , and Fergus , R. Depth Map Prediction from a Single Image using a Multi-Scale Deep Network Advances in Neural Information Processing Systems 27 2014
- Zhou , T. , Brown , M. , Snavely , N. , and Lowe , D.G. Unsupervised Learning of Depth and Ego-Motion From Video Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
- Yang , Z. , Wang , P. , Xu , W. , Zhao , L. , and Nevatia , R. 2017
- Mahjourian , R. , Wicke , M. , and Angelova , A. Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3d Geometric Constraints Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018 5667 5675
- Wang , C. , Buenaposada , J.M. , Zhu , R. , and Lucey , S. Learning Depth from Monocular Videos using Direct Methods CoRR abs/1712.0 2017
- Zou , Y. , Luo , Z. , and Huang , J.-B. Df-Net: Unsupervised Joint Learning of Depth and Flow Using Cross-Task Consistency Proceedings of the European Conference on Computer Vision (ECCV) 2018 36 53
- Yin , Z. and Shi , J. Geonet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018 1983 1992