This content is not included in
your SAE MOBILUS subscription, or you are not logged in.
Vision Based Object Distance Estimation
Technical Paper
2017-01-0109
ISSN: 0148-7191, e-ISSN: 2688-3627
Annotation ability available
Sector:
Language:
English
Abstract
This work describes a single camera based object distance estimation system. As technology on vehicles is constantly advancing on the road to autonomy, it is critical to know the locations of objects in 3D space for safe behavior of the vehicle. Though significant progress has been made on object detection in 2D sensor space from a single camera, this work additionally estimates the distance to said object without requiring stereo vision or absolute knowledge of vehicle motion. Specifically, our proposed system is comprised of three modules: vision based ego-motion estimation, object-detection, and distance estimation. In particular, we compensate for the vehicle ego-motion by using pin-hole camera model to increase the accuracy of the object distance estimation. In the ego-motion estimation stage, the proposed system utilizes the state-of-art technology, Oriented FAST and Rotated Brief (ORB) feature detector and descriptor, to robustly estimate the feature correspondence between the consecutive image frames. The six degrees-of-freedom ego motion estimation is then carried out by decomposing the essential matrix that is estimated from feature correspondences. The ego-motion estimations are further refined by bundle adjustment within a local temporal window. Finally, we use a deep neural network (DNN) in the object detection module, followed by distance estimation of the detected object based on pin-hole camera model. The proposed mono-camera system yields reliable distance estimation with low cost, as well as small overall data throughput. The estimation accuracy can be further improved with the fusion of the additional sensors (e.g. Radar, Lidar, ultrasonic).
Recommended Content
Authors
Citation
Zhang, Y., Goh, M., and Nariyambut Murali, V., "Vision Based Object Distance Estimation," SAE Technical Paper 2017-01-0109, 2017, https://doi.org/10.4271/2017-01-0109.Also In
References
- Bouguet J.-Y. Camera calibration toolbox for matlab 2004 3
- Dalal and N. Triggs B. Histograms of oriented gradients for human detection 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05) 1 886 893 2005
- Everingham , M . Zisserman , A . Williams , C. K. Van Gool , L. Allan , M. Bishop , C. M. Chapelle , O. Dalal , N. Deselaers , T. Dorkó , G. The pascal visual object classes challenge 2007 (voc2007) results 2007 2
- Felzenszwalb , P. F. Girshick , R. B. McAllester , and D. Ramanan D. Object detection with discriminatively trained part-based models IEEE transactions on pattern analysis and machine intelligence 32 9 1627 1645 2010
- Fischler and M. A. Bolles R. C. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography Communications of the ACM 24 6 381 395 1981
- Girshick R. Fast r-cnn Proceedings of the IEEE International Conference on Computer Vision 1440 1448 2015
- Girshick , R. Donahue , J. Darrell , and T. Malik J. Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition 580 587 2014
- Hartley and R. Zisserman A. Multiple view geometry in computer vision Cambridge university press 2003 3
- Jia , Y. Shelhamer , E. Donahue , J. Karayev , S. Long , J. Girshick , R. Guadarrama , and S. Darrell T. Caffe: Convolutional architecture for fast feature embedding Proceedings of the 22nd ACM international conference on Multimedia 675 678 ACM 2014
- Lienhart and R. Maydt J. An extended set of haar-like features for rapid object detection Image Processing. 2002. Proceedings. 2002 International Conference on 1 I 900 2002
- Liu , W. Anguelov , D. Erhan , D. Szegedy , and C. Reed S. Ssd: Single shot multibox detector arXiv preprint arXiv:1512.02325 2015
- Nistér D. An efficient solution to the five-point relative pose problem IEEE transactions on pattern analysis and machine intelligence 26 6 756 770 2004
- Redmon , J. Divvala , S. Girshick , and R. Farhadi A. You only look once: Unified, real-time object detection arXiv preprint arXiv:1506.02640 2015
- Ren , S. He , K. Girshick , and R. Sun J. Faster r-cnn: Towards real-time object detection with region proposal networks Advances in neural information processing systems 91 99 2015
- Russakovsky , O. Deng , J. Su , H. Krause , J. Satheesh , S. Ma , S. Huang , Z. Karpathy , A. Khosla , A. Bernstein , M. Imagenet large scale visual recognition challenge International Journal of Computer Vision 115 3 211 252 2015
- Simonyan and K. Zisserman A. Very deep convolutional networks for large-scale image recognition arXiv preprint arXiv:1409.1556 2014 2
- Triggs , B. McLauchlan , P. F. Hartley , and R. I. Fitzgibbon A. W. Bundle adjustmenta modern synthesis International workshop on vision algorithms 298 372 Springer 1999
- Viola and P. Jones M. Rapid object detection using a boosted cascade of simple features Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on 1 I 511 2001