Computer Vision for UAV-based Reconnaissance under Conditions of Modern Warfare
DOI:
https://doi.org/10.18372/1990-5548.86.20560Keywords:
unmanned aerial vehicle, computer vision, deep learning, object detection, military reconnaissance, neural networksAbstract
This paper considers the application of computer vision and deep learning methods for automated aerial reconnaissance using unmanned aerial vehicles under the conditions of modern warfare. The main classes of reconnaissance objects are analyzed, including military vehicles, fortifications, artillery positions, and groups of personnel. An approach to building an object detection system based on deep neural networks is proposed, in particular using YOLO-type detectors and U-Net segmentation models. The process of data preparation and augmentation with consideration of combat factors (smoke, explosions, low illumination, image shift, and noise) is described. An experimental evaluation of object detection quality under different scenarios is performed. It is shown that the use of specially adapted augmentation significantly increases the robustness of the models to interference. The limitations of the proposed approach and directions for further research are discussed.
References
J. Redmon and A. Farhadi, “YOLOv3: An Incremental Improvement,” arXiv preprint arXiv:1804.02767, 2018. https://arxiv.org/abs/1804.02767
A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, “YOLOv4: Optimal Speed and Accuracy of Object Detection,” arXiv preprint arXiv:2004.10934, 2020. https://arxiv.org/abs/2004.10934
O. Ronneberger, P. Fischer, and T. Brox, “U-Net: Convolutional Networks for Biomedical Image Segmentation,” Proc. MICCAI, 2015. https://arxiv.org/abs/1505.04597 https://doi.org/10.1007/978-3-319-24574-4_28
T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, “Focal Loss for Dense Object Detection,” IEEE International Conference on Computer Vision (ICCV), 2017. https://arxiv.org/abs/1708.02002 https://doi.org/10.1109/ICCV.2017.324
M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman, “The PASCAL Visual Object Classes (VOC) Challenge,” International Journal of Computer Vision, vol. 88, no. 2, pp. 303–338, 2010. https://link.springer.com/article/10.1007/s11263-009-0275-4
G.-S. Xia, J. Hu, F. Hu, X. Shi, X. Bai, Y. Zhong, L. Zhang, and X. Lu, “DOTA: A Large-Scale Dataset for Object Detection in Aerial Images,” Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. https://arxiv.org/abs/1711.10398 https://doi.org/10.1109/CVPR.2018.00418
D. Lam, et al., “xView: Objects in Context in Overhead Imagery,” Proc. IEEE CVPR, 2018. https://arxiv.org/abs/1802.07856
S. Shah, D. Dey, C. Lovett, and A. Kapoor, “AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles,” Proc. IEEE CVPR, 2017. https://arxiv.org/abs/1705.05065 https://doi.org/10.1007/978-3-319-67361-5_40
A. G. Howard, et al., “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications,” arXiv preprint arXiv:1704.04861, 2017. https://arxiv.org/abs/1704.04861
A. Garcia, A. Gordon, J. Hurst, S. Konecny, S. Kim, and K. An, “Glider: A Deep Learning Framework for Edge Inference,” arXiv preprint arXiv:2003.03119, 2020. https://arxiv.org/abs/2003.03119
C. Shorten and T. M. Khoshgoftaar, “A Survey on Image Data Augmentation for Deep Learning,” Journal of Big Data, vol. 6, no. https://journalofbigdata.springeropen.com/articles/10.1186/s40537-019-0197-0 60, 2019.
I. Goodfellow, Y. Bengio, and A. Courville, “Deep Learning,” MIT Press, Cambridge, 2016. https://www.deeplearningbook.org
Downloads
Published
How to Cite
Issue
Section
License
The scientific journal “Electronics and control systems” adheres to the principles of Open Access and provides free, immediate, and permanent access to all published materials without financial, technical, or legal barriers for readers.
All articles are published in Open Access under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.
Copyright
Authors who publish their works in the journal “Electronics and control systems”:
-
retain the copyright to their publications;
-
grant the journal the right of first publication of the article;
-
agree to the distribution of their materials under the CC BY 4.0 license;
-
have the right to reuse, archive, and distribute their works (including in institutional and subject repositories), provided that proper reference is made to the original publication in the journal.




