Main > Laser & Optoelectronics Progress >  Volume 57 >  Issue 24 >  Page 241008 > Article
  • Abstract
  • Abstract
  • View Summary
  • Figures (13)
  • Tables (2)
  • Equations (0)
  • References (21)
  • Get PDF(in Chinese)
  • Paper Information
  • Received: Feb. 28, 2020

    Accepted: Jun. 9, 2020

    Posted: Dec. 1, 2020

    Published Online: Nov. 19, 2020

    The Author Email: Wang Dianwei (, Fang Haoyu (

    DOI: 10.3788/LOP57.241008

  • Get Citation
  • Copy Citation Text

    Dianwei Wang, Haoyu Fang, Ying Liu, Jing Jiang, Xincheng Ren, Zhijie Xu, Yongrui Qin. Algorithm for Panoramic Video Tracking Based on Improved SiameseRPN[J]. Laser & Optoelectronics Progress, 2020, 57(24): 241008

    Download Citation

  • Category
  • Image Processing
  • Share
Laser & Optoelectronics Progress, Vol. 57, Issue 24, 241008 (2020)

Algorithm for Panoramic Video Tracking Based on Improved SiameseRPN

Wang Dianwei1,*, Fang Haoyu1,*, Liu Ying1, Jiang Jing1, Ren Xincheng2, Xu Zhijie3, and Qin Yongrui3

Author Affiliations

  • 1School of Communications and Information Engineering, Xi'an University of Posts and Telecommunications, Xi'an, Shaanxi 710121, China
  • 2School of Physics and Electronic Information, Yan'an University, Yan'an, Shaanxi 716000, China
  • 3School of Computing and Engineering, University of Huddersfield, Huddersfield HD1 3DH UK


The target tracking algorithm is suffering from low accuracy and poor applicability due to the complex lighting conditions and severe changes in scale caused by the relative lens movement during panoramic video target tracking. To address this issue, we propose an algorithm for panoramic target tracking based on the improved SiameseRPN. First, the network structure of MobileNetV3 is used to extract the deep features to make the algorithm have a better adaptability to the scene defects appearing in panoramic video sequences, and the Squeeze and Excite module is used to increase the sensitivity of the network to feature selection. Then, we propose and construct a feature fusion module based on bilinear interpolation, which is used to make the output depth features of the last three layers have the same scale, and these three layers of features are fused for network prediction. Finally, we use a classification sequence to predict the positive and negative samples in the current sequence, and adopt a regression branch to predict the position information and scale information of current output targets. Thus the target position information is outputted. The experimental results show that the proposed algorithm has better tracking accuracy and it can effectively deal with the problems of poor local image quality and scale changes in panoramic data, while maintaining the real-time tracking performance. It shows a good adaptability to small targets, target occlusion, and multi-target cross movements in target tracking, and has good visual effects and high tracking scores.


Please Enter Your Email: