Interface architecture of a modular video analysis platform for autonomous uav control

Authors

DOI:

https://doi.org/10.18372/2073-4751.86.21278

Keywords:

UAV, computer vision, modular architecture, video analysis, MAVLink, MAVSDK, software interface, object tracking

Abstract

The purpose of this work is to present the interface architecture of a modular software platform for video analysis in autonomous UAV control systems, with emphasis on extensibility and component replaceability.
The platform is organized around three independent modularity levels. The first is the video source contract (CameraSource): regardless of the camera type, each source runs a background capture thread and writes frames to a shared buffer. Three implementations are available – USBCameraSource via OpenCV VideoCapture, PiCameraSource using the Picamera2 library for Raspberry Pi CSI cameras (including infrared models), and VideoFileSource for offline development with recorded footage. Switching between them requires changing a single line in the server configuration. The second level is the frame processing contract (FrameProcessor), built on duck typing rather than formal inheritance. A researcher implements one mandatory method – photo_processing(frame) – which receives the current BGR frame as a NumPy array. Three optional methods extend the interaction: catch_cmd(roi) is called when the operator clicks on the video stream to select a region of interest; customize_processing(id) responds to one of four control panel buttons, enabling in-flight algorithm parameterization; getPreparedVector() closes the autonomous control loop by returning velocity components vx, vy and execution time t. The third level covers flight controller connectivity: the FlightController class wraps MAVSDK and accepts a MAVLink connection string as its only configuration parameter, enabling transparent switching between Pixhawk, SpeedyBee, Matek hardware and the SITL software simulator. Four FrameProcessor implementations – a CSRT tracker with SIFT initialization, SIFT homography-based position holding, YOLO object detection, and a minimal template – were integrated and validated on the platform, each implemented as a single class without touching any other component.
The proposed architecture lowers the barrier for integrating new video processing algorithms to implementing a minimal class interface. A researcher working with the platform needs to focus only on the algorithm itself – camera handling, telemetry logging, and flight command execution are already provided. This makes the platform a practical and reproducible baseline for vision-based UAV autonomy research.

References

Prystavka P., Cholyshkina O. Estimation of the aircraft's position based on optical channel data. CEUR Workshop Proceedings. 2024. Vol. 3925. P. 93-105.

Ruzhentsev N. et al. Radio-heat contrasts of UAVs and their weather variability at 12 GHz, 20 GHz, 34 GHz, and 94 GHz frequencies. ECTI Transactions on Electrical Engineering, Electronics, and Communications. 2022. Vol. 20. P. 163-173. DOI: 10.37936/ecti-eec.2022202.246878

Liu S., Li X., Lu H., He Y. Multi-object tracking meets moving UAV. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022. P. 8866-8875. DOI: 10.1109/CVPR52688.2022.00867

Mohsan S.A.H. et al. Unmanned aerial vehicles (UAVs): practical aspects, applications, open challenges, security issues, and future trends. Intelligent Service Robotics. 2023. Vol. 16. P. 109-137. DOI: 10.1007/s11370-022-00452-4

Hong T. et al. A real-time tracking algorithm for multi-target UAV based on deep learning. Remote Sensing. 2023. Vol. 15, No. 1. DOI: 10.3390/rs15010002

Arafat M.Y., Alam M.M., Moh S.

Vision-based navigation techniques for unmanned aerial vehicles: Review and challenges. Drones. 2023. Vol. 7. DOI: 10.3390/drones7020089

Koubaa A. et al. Micro Air Vehicle Link (MAVLink) in a nutshell: A survey. IEEE Access. 2019. Vol. 7. P. 87658-87680. DOI: 10.1109/ACCESS.2019.2924410

Ortega L.D., Olivares-Mendez M.A., Campoy P. Low-cost computer-vision-based embedded systems for UAVs. Robotics. 2023. Vol. 12. DOI: 10.3390/robotics12060145

Lukash Y., Prystavka P. A research platform for vision-based UAV autonomy: Architecture and implementation. CEUR Workshop Proceedings. 2025. Vol. 4024. P. 250-259.

Жуков І., Лукаш Ю. Використання алгоритму трекінгу об’єкту по відеозображенню для реалізації автономної функції слідування за ціллю для БПЛА. Проблеми інформатизації та управління. 2024. Т. 2, № 78. С. 14-17. DOI: 10.18372/2073-4751.78.18956

Published

2026-05-30

How to Cite

Lukash, Y. (2026). Interface architecture of a modular video analysis platform for autonomous uav control. Problems of Informatization and Control, 2(86), 89–96. https://doi.org/10.18372/2073-4751.86.21278

Issue

Section

Статті