This content is not included in your SAE MOBILUS subscription, or you are not logged in.
An Improved Multi-Pedestrian Tracking System Based on Deep Neural Network
ISSN: 0148-7191, e-ISSN: 2688-3627
Published April 02, 2019 by SAE International in United States
This content contains downloadable datasetsAnnotation ability available
The intelligent vehicle driverless technology has become a very hot topic in the past two decades. To solve the road safety problem, which is one of the most important factors inhibiting the development of intelligent vehicle technology, the multi-target tracking system has attracted more and more attention in recent study since it can detect and track multiple objects in traffic scene so as to help the whole driving system plan the safe route. In this work, a novel multi-pedestrian tracking system based on deep neural network is proposed to improve the tracking efficiency while providing high recognition accuracy. The proposed tracking system consists of two parts: 1) pedestrian detector, and 2) pedestrian tracker. For the detector part, we first transform the image convolution operation in spatial-temporal domain to the coefficient product operation in complex frequency domain, and then replace the maximum pooling and mean pooling operation in the traditional SSD detection network by the proposed product operation, and the modified SSD model is used as the pedestrian detector. For the tracker part, we train a corresponding appearance model and calculate the appearance similarity of the detected targets between the continuous frames with respect to the cosine distance. To evaluate the performance of our proposed system, we implement the system on the MOT16 benchmark. The detector can improve tracking accuracy by up to 8.9% and the tracker updates the speed of whole system at a rate of 126Hz, our extensions reducing the number of identity switches by 15%. The experimental results demonstrate that the proposed multi-pedestrian tracking system provides better real-time performance and accuracy than the state-of-the-art methods, which can provide more accurate object information for the following auto-driving system with higher safety.
CitationGong, Y., Chi, J., Yu, X., Wu, C. et al., "An Improved Multi-Pedestrian Tracking System Based on Deep Neural Network," SAE Technical Paper 2019-01-1055, 2019, https://doi.org/10.4271/2019-01-1055.
Data Sets - Support Documents
|[Unnamed Dataset 1]|
|[Unnamed Dataset 2]|
- Luo, W. et al., “Multiple Object Tracking: A Literature Review,” Eprint Arxiv, 2015.
- Hall, D. and Perona, P., “Online, Real-Time Tracking Using a Category-to-Individual Detector,” Lecture Notes in Computer Science 8689, 2014, 361-376.
- Benfold, B. and Reid, I., “Stable Multi-target Tracking in Real-Time Surveillance Video,” in Computer Vision and Pattern Recognition (CVPR), IEEE, 2011, 3457-3464.
- Reichlin, F. et al., “Online Multiperson Tracking-by-Detection from a Single, Uncalibrated Camera,” IEEE Transactions on Pattern Analysis & Machine Intelligence 33(9):1820-1833, 2011.
- Kuo, C.H., Huang, C., and Nevatia, R., “Multi-Target Tracking by On-line Learned Discriminative Appearance Models,” Computer Vision and Pattern Recognition IEEE685-692, 2010.
- Bochinski, E., Eiselein, V., and Sikora, T., “High-Speed Tracking-by-Detection without Using Image Information,” in IEEE International Conference on Advanced Video and Signal Based Surveillance, IEEE, 2017.
- Milan, A. et al., “Online Multi-Target Tracking Using Recurrent Neural Networks,” 2016.
- Dehghan, A., Assari, S.M., and Shah, M., “GMMCP Tracker: Globally Optimal Generalized Maximum Multi Clique Problem for Multiple Object Tracking,” IEEE International Conference on Computer Vision and Pattern Recognition, IEEE, 2015, 4091-4099.
- Huang, C., Li, Y., and Nevatia, R., “Multiple Target Tracking by Learning-Based Hierarchical Association of Detection Responses,” IEEE Transactions on Pattern Analysis & Machine Intelligence 35(4):898, 2013.
- Milan, A., Schindler, K., and Roth, S., “Detection- and Trajectory-Level Exclusion in Multiple Object Tracking,” in IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, 2013, 3682-3689.
- Wen, L. et al., “Multiple Target Tracking Based on Undirected Hierarchical Relation Hypergraph,” Computer Vision and Pattern Recognition IEEE1282-1289, 2014.
- Yang, B. and Nevatia, R., “Online Learned Discriminative Part-Based Appearance Models for Multi-Human Tracking,” European Conference on Computer Vision (Berlin, Springer, 2012).
- Huang, C., Wu, B., and Nevatia, R., “Robust Object Tracking by Hierarchical Association of Detection Responses,” European Conference on Computer Vision (Berlin, Springer, 2008), 788-801.
- Liu, W. et al., “SSD: Single Shot MultiBox Detector,” in European Conference on Computer Vision, Springer International Publishing, 2016, 21-37.
- Rippel, O., Snoek, J., and Adams, R.P., “Spectral Representations for Convolutional Neural Networks,” 2015, 2449-2457.
- Munkres, J., “Algorithms for the Assignment and Transportation Problems,” Journal of the Society for Industrial & Applied Mathematics 5(1):32-38, 1957.
- Milan, A. et al., “MOT16: A Benchmark for Multi-Object Tracking,” 2016.
- Geiger, A. et al., “3D Traffic Scene Understanding from Movable Platforms,” IEEE Transactions on Pattern Analysis & Machine Intelligence 36(5):1012-1025, 2014.
- Pirsiavash, H., Ramanan, D., and Fowlkes, C.C.. “Globally-Optimal Greedy Algorithms for Tracking a Variable Number of Objects,” in IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, 2011, 1201-1208.
- Dicle, C., Camps, O.I., and Sznaier, M., “The Way They Move: Tracking Multiple Targets with Similar Appearance,” in Proceedings of the IEEE International Conference on Computer Vision, 2013.
- Choi, W., “Near-Online Multi-Target Tracking with Aggregated Local Flow Descriptor,” in Proceedings of the IEEE International Conference on Computer Vision, 2015.
- Yoon, J.H. et al., “Bayesian Multi-Object Tracking Using Motion Context from Multiple Objects,” in Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on, IEEE, 2015.
- Yang, M. and Jia, Y., “Temporal Dynamic Appearance Modeling for Online Multi-Person Tracking,” Computer Vision & Image Understanding 31(9):626-633, 2016.
- Xiang, Y., Alahi, A., and Savarese, S., “Learning to Track: Online Multi-Object Tracking by Decision Making,” in IEEE International Conference on Computer Vision IEEE, 2015, 4705-4713.
- Bernardin, K. and Stiefelhagen, R., “Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics,” Eurasip Journal on Image & Video Processing 1(2008):246309, 2008.
- Ren, S. et al., “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks,” International Conference on Neural Information Processing Systems (MIT Press, 2015), 91-99.
- He, Q. et al., “SOT for MOT,” arXiv preprint arXiv:1712.01059, 2017.