ZHANG Shuo, WU Yanxia, MEN Chaoguang, et al., “Tiny YOLO Optimization Oriented Bus Passenger Object Detection,” Chinese Journal of Electronics, vol. 29, no. 1, pp. 132-138, 2020, doi: 10.1049/cje.2019.11.002
Citation: ZHANG Shuo, WU Yanxia, MEN Chaoguang, et al., “Tiny YOLO Optimization Oriented Bus Passenger Object Detection,” Chinese Journal of Electronics, vol. 29, no. 1, pp. 132-138, 2020, doi: 10.1049/cje.2019.11.002

Tiny YOLO Optimization Oriented Bus Passenger Object Detection

doi: 10.1049/cje.2019.11.002
Funds:  This work is supported by the National Key R&D Program of China (No.2016YFB1000402), the Natural Science Foundation of Heilongjiang Province (No.F2018008), the Foundation for Distinguished Young Scholars of Harbin (No.2017RAYXJ016), and the Fundamental Research Funds for the Central Universities (No.3072019CFT0602).
More Information
  • Corresponding author: WU Yanxia (corresponding author) was born in 1979. She received the B.S., M.S. and Ph.D. degrees from Harbin Engineering University, Harbin, China. Now she is an associate professor in the College of Computer Science and Technology in Harbin Engineering University. Her current research interests include compiler technology and computer architecture. (Email:wuyanxia@hrbeu.edu.cn)
  • Received Date: 2019-01-18
  • Rev Recd Date: 2019-06-27
  • Publish Date: 2020-01-10
  • The real-time collection of bus passenger object detection is an essential part of developing a smart bus system. The difficulty of object detection mainly lies in the objective factors, such as:clothing, hair style and accessories, light, etc. Traditional object detection methods with the artificial feature extraction suffers from insufficient strength in expression, generalization, and recognition rate. The object detection method based on deep learning mainly uses the convolutional neural network in deep learning to learn features from a large set of data. The learned features can describe the rich information inherent in the data, and improve the expression ability of the features as well as the recognition accuracy. Due to too many parameters of the Convolutional neural network (CNN) model, the amount of calculation is too large to be operated on the vehicle terminal. To reduce calculation burden and improve the operation speed, we employs the depthwise separable convolution method to optimize the convolutional layer of tiny YOLO network model. It decomposes a complete convolution operation into depthwise convolution and pointwise convolution, thus reducing the parameter amount of the CNN and improving the operation speed. The experiment results reveal that the speed of bus passenger object detection detected by our improved model is 4 times faster than the previous one but with the nearly same detection accuracy.
  • loading
  • R. Girshick, J. Donahue, T. Darrell, et al., "Rich feature hierarchies for accurate object detection and semantic segmentation", The IEEE Conference on Computer Vision and Pattern Recognition, Columbus, United States, Columbus, U.S., pp.580-587, 2014.
    P. Viola and M.J. Jones, "Robust real-time face detection", International Journal of Computer Vision, Vol.57, No.2, pp.137-154, 2004.
    N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection", Computer Vision and Pattern Recognition, San Diego, United States, pp.886-893, 2005.
    J. Huang, V. Rathod, C. Sun, et al., "Speed/accuracy trade-offs for modern convolutional object detectors", The IEEE Conference on Computer Vision and Pattern Recognition, Hawaii, U.S., pp.7310-7311, 2017.
    S.Q. Ren, K.M. He, R. Girshick, "Faster r-cnn:Towards real-time object detection with region proposal networks", Advances in Neural Information Processing Systems 28, Montreal, Quebec, Canada, pp.91-99, 2015.
    W. Liu, D. Anguelov D. Erhan, et al., "SSD:Single shot multiBox detector", European Conference on Computer Vision, Amsterdam, The Netherlands, pp.21-37, 2016.
    J. Redmon, S. Divvala, R. Girshick, et al., "You only look once:Unified, real-Time object detection", The IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, U.S., pp.779-788, 2016.
    J. Redmon, "YOLO:Real-time object detection", https://pjreddie.com/darknet/YOLO/, 2018-3-25.
    C.E. Perez, "Building a 50 Teraflops AMD Vega Deep Learning Box for Under $3K", https://medium.com/intuitionmachine/building-a-50-teraflops-amd-vege-deep-learning-boxfor-under-3k-ebdd60d4a93c, 2017-7-14.
    X.Y. Zhang, X.Y. Zhou, M.X. Lin, et al., "ShuffleNet:An extremely efficient convolutional neural network for mobile devices", The IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, U.S., pp.6848-6856, 2018.
    G.Y Chen, S. Krishnan, Y.J Zhao, et al., "Illumination invariant face recognition", ICIC 2013:Intelligent Computing Theories, Nanning, China, pp.385-391, 2013.
    L. Sifre and S. Mallat, "Rigid-motion scattering for image classification", Ph.D.Thesis, Ecole Polytechnique, CMAP, France, 2014.
    M. Mathias, R. Benenson, M. Pedersoli, et al., "Face detection without bells and whistles", European Conference on Computer Vision, Zurich, Switzerland, pp.720-735, 2014.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (888) PDF downloads(1076) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return