Analysis of modern computer vision methods for monitoring and identifying flying objects: challenges and prospects
DOI:
https://doi.org/10.18372/2073-4751.86.21283Keywords:
Computer vision, airspace monitoring, neural networks, deep learning, flying object identificationAbstract
This review article presents a comprehensive analysis of modern software and algorithmic computer vision methods used for monitoring and identifying flying objects in airspace. The relevance of the study is driven by the rapid development of unmanned aerial vehicles and the growing need for efficient, affordable, and highly accurate surveillance systems capable of operating under uncertainty, limited resources, and complex backgrounds. The classification of the main computer vision tasks is considered, including object detection, classification and identification, localization and tracking, as well as estimation of the spatial position and behavioral characteristics of targets. The evolution of detection methods from traditional approaches to modern deep learning architectures is analyzed, including convolutional neural networks, one-stage and two-stage detectors, visual transformers, and specialized approaches for detecting small-sized objects. Particular attention is paid to the problem of distinguishing birds from drones, multi-object tracking methods, data association algorithms, and the robustness of systems against adversarial attacks. A comparative analysis of the effectiveness of computer vision methods is also carried out.
References
Taha, B., & Shoufan, A. Machine learning-based drone detection and classification: State-of-the-art in research. IEEE Access, 2019, Vol. 7., p. 138669 - 138682. DOI: 10.1109/ACCESS.2019.2942944
Coluccia, A., Fascista, A., Schumann, A., et al. (2021). Drone vs. bird detection: Deep learning algorithms and results from a grand challenge. Sensors, 21(8).
Smyers E., Katz S., Corso A., Kochenderfer M. AVOIDDS: Aircraft Vision-based Intruder Detection Dataset and Simulator. URL: https://arxiv.org/abs/2306.11203
Girshick R., Donahue J., Darrell T., MalikRich J. feature hierarchies for accurate object detection and semantic segmentation. URL: https://arxiv.org/abs/1311.2524
J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 779-788, DOI: 10.1109/CVPR.2016.91
A. Coluccia et al., "Drone-vs-Bird detection challenge at IEEE AVSS2017," 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy, 2017, pp. 1-6, doi: 10.1109/AVSS.2017.8078464
Thrampoulidis C., Oymak S., Soltanolkotabi M. Theoretical Insights Into Multiclass Classification: A High-dimensional Asymptotic View. URL: https://arxiv.org/abs/2011.07729
M. Nalamati, A. Kapoor, M. Saqib, N. Sharma and M. Blumenstein, "Drone Detection in Long-Range Surveillance Videos," 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Taipei, Taiwan, 2019, pp. 1-6, doi: 10.1109/AVSS.2019.8909830
Zhu P., Wen L., Du D., Bian X., Fan H., Hu Q., Ling H. Detection and Tracking Meet Drones Challenge. URL: https://arxiv.org/abs/2001.06303
Khanpour A., Wang T., Vahidi-Shams A., Ectors W., Nakhaie F., Taheri A., Claudel C. UAV-Based Intelligent Traffic Surveillance System: Real-Time Vehicle Detection, Classification, Tracking, and Behavioral Analysis. URL: https://arxiv.org/abs/2509.04624
Kaur D., Battish N., Bhavsar A., PoddarS. YOLOBirDrone: Dataset for Bird vs Drone Detection and Classification and a YOLO based enhanced learning architecture. URL: https://arxiv.org/abs/2601.08319
C. Aker and S. Kalkan, "Using deep networks for drone detection," 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy, 2017, pp. 1-6, doi: 10.1109/AVSS.2017.8078539.
Xi H, Ru L, Tian J, Wang W, Zhu R, Li S, Zhang Z, Liu L, Luan X. Towards Robust Physical Adversarial Attacks on UAV Object Detection: A Multi-Dimensional Feature Optimization Approach. Machines. 2025; 13(11):1060. https://doi.org/10.3390/machines13111060
N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection," 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), San Diego, CA, USA, 2005, pp. 886-893 vol. 1, doi: 10.1109/CVPR.2005.177.
T. -Y. Lin, P. Goyal, R. Girshick, K. He and P. Dollár, "Focal Loss for Dense Object Detection," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 2, pp. 318-327, 1 Feb. 2020, doi: 10.1109/TPAMI.2018.2858826.
Varghese, R., & M., S. (2024). YOLOv8: A Novel Object Detection Algorithm with Enhanced Performance and Robustness. In 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS) (pp. 1-6).
Ren, S., He, K., Girshick, R., & Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Advances in Neural Information Processing Systems, Vol. 28. Curran Associates, Inc. 2015.
Yang M. Visual Transformer for Object Detection. URL: https://arxiv.org/abs/2206.06323
Shah, S., Tembhurne, J. Object detection using convolutional neural networks and transformer-based models: a review. Journal of Electrical Systems and Inf Technol 10, 54 (2023). https://doi.org/10.1186/s43067-023-00123-z
M. Ali A., Benjdira B., Koubaa A., El-Shafai W., Khan Z., Boulila W. Vision Transformers in Image Restoration: A Survey. https://www.researchgate.net/publication/368691069_Vision_Transformers_in_Image_Restoration_A_Survey
Brais Bosquet, Manuel Mucientes, & Víctor M. Brea (2021). STDnet-ST: Spatio-temporal ConvNet for small object detection. Pattern Recognition, 116, 107929.
T. -Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan and S. Belongie, "Feature Pyramid Networks for Object Detection," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 936-944, doi: 10.1109/CVPR.2017.106.
Xiao, Y., Li, Z., & Zhang, Z. (2024). Enhancing Image Perception Quality: Exploring Loss Function Variants in SRCNN. In Proceedings of the 2024 7th International Conference on Computer Information Science and Artificial Intelligence (pp. 352–356). Association for Computing Machinery.
Zhang, H., Hao, C., Song, W., Jiang, B., & Li, B. (2023). Adaptive Slicing-Aided Hyper Inference for Small Object Detection in High-Resolution Remote Sensing Images. Remote Sensing, 15(5).
Nassi, B., Mirsky, Y., Nassi, D., Ben-Netanel, R., Drokin, O., & Elovici, Y. (2020). Phantom of the ADAS: Securing Advanced Driver-Assistance Systems from Split-Second Phantom Attacks. In Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security (pp. 293–308). Association for Computing Machinery.
Tan, M., & Le, Q. (2019). EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the 36th International Conference on Machine Learning (pp. 6105–6114). PMLR.
M. Sandler, A. Howard, M. Zhu, A. Zhmoginov and L. -C. Chen, "MobileNetV2: Inverted Residuals and Linear Bottlenecks," 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018, pp. 4510-4520, doi: 10.1109/CVPR.2018.00474.
Koch G., Zemel R. Siamese Neural Networks for One-shot Image Recognition. URL: https://www.cs.cmu.edu/~rsalakhu/papers/oneshot1.pdf
Nesteruk, A., Nikitin, V., Albrekht, Y., Ścisło, Ł., Grela, D., & Król, P. (2026). Distinguishing a Drone from Birds Based on Trajectory Movement and Deep Learning. Sensors, 26(3).
Khan, S., Naseer, M., Hayat, M., Zamir, S., Khan, F., & Shah, M. (2022). Transformers in Vision: A Survey. ACM Comput. Surv., 54(10s).
Y. Wu, J. Lim and M. -H. Yang, "Online Object Tracking: A Benchmark," 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2013, pp. 2411-2418, doi: 10.1109/CVPR.2013.312.
M. S. Arulampalam, S. Maskell, N. Gordon and T. Clapp, "A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking," in IEEE Transactions on Signal Processing, vol. 50, no. 2, pp. 174-188, Feb. 2002, doi: 10.1109/78.978374.
Henriques, J., Caseiro, R., Martins, P., & Batista, J. (2015). High-Speed Tracking with Kernelized Correlation Filters. IEEE Trans. Pattern Anal. Mach. Intell., 37(3), 583–596.
N. Wojke, A. Bewley and D. Paulus, "Simple online and realtime tracking with a deep association metric," 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017, pp. 3645-3649, doi: 10.1109/ICIP.2017.8296962.
Zhang, Y., Sun, P., Jiang, Y., Yu, D., Weng, F., Yuan, Z., Luo, P., Liu, W., & Wang, X. (2022). ByteTrack: Multi-object Tracking by Associating Every Detection Box. In Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXII (pp. 1–21). Springer-Verlag.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
The scientific journal adheres to the principles of Open Access and provides free, immediate, and permanent access to all published materials without financial, technical, or legal barriers for readers.
All articles are published in Open Access under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.
Copyright
Authors who publish their works in the journal:
-
retain the copyright to their publications;
-
grant the journal the right of first publication of the article;
-
agree to the distribution of their materials under the CC BY 4.0 license;
-
have the right to reuse, archive, and distribute their works (including in institutional and subject repositories), provided that proper reference is made to the original publication in the journal.




