Interface architecture of a modular video analysis platform for autonomous uav control
DOI:
https://doi.org/10.18372/2073-4751.86.21278Keywords:
UAV, computer vision, modular architecture, video analysis, MAVLink, MAVSDK, software interface, object trackingAbstract
The purpose of this work is to present the interface architecture of a modular software platform for video analysis in autonomous UAV control systems, with emphasis on extensibility and component replaceability.
The platform is organized around three independent modularity levels. The first is the video source contract (CameraSource): regardless of the camera type, each source runs a background capture thread and writes frames to a shared buffer. Three implementations are available – USBCameraSource via OpenCV VideoCapture, PiCameraSource using the Picamera2 library for Raspberry Pi CSI cameras (including infrared models), and VideoFileSource for offline development with recorded footage. Switching between them requires changing a single line in the server configuration. The second level is the frame processing contract (FrameProcessor), built on duck typing rather than formal inheritance. A researcher implements one mandatory method – photo_processing(frame) – which receives the current BGR frame as a NumPy array. Three optional methods extend the interaction: catch_cmd(roi) is called when the operator clicks on the video stream to select a region of interest; customize_processing(id) responds to one of four control panel buttons, enabling in-flight algorithm parameterization; getPreparedVector() closes the autonomous control loop by returning velocity components vx, vy and execution time t. The third level covers flight controller connectivity: the FlightController class wraps MAVSDK and accepts a MAVLink connection string as its only configuration parameter, enabling transparent switching between Pixhawk, SpeedyBee, Matek hardware and the SITL software simulator. Four FrameProcessor implementations – a CSRT tracker with SIFT initialization, SIFT homography-based position holding, YOLO object detection, and a minimal template – were integrated and validated on the platform, each implemented as a single class without touching any other component.
The proposed architecture lowers the barrier for integrating new video processing algorithms to implementing a minimal class interface. A researcher working with the platform needs to focus only on the algorithm itself – camera handling, telemetry logging, and flight command execution are already provided. This makes the platform a practical and reproducible baseline for vision-based UAV autonomy research.
References
Prystavka P., Cholyshkina O. Estimation of the aircraft's position based on optical channel data. CEUR Workshop Proceedings. 2024. Vol. 3925. P. 93-105.
Ruzhentsev N. et al. Radio-heat contrasts of UAVs and their weather variability at 12 GHz, 20 GHz, 34 GHz, and 94 GHz frequencies. ECTI Transactions on Electrical Engineering, Electronics, and Communications. 2022. Vol. 20. P. 163-173. DOI: 10.37936/ecti-eec.2022202.246878
Liu S., Li X., Lu H., He Y. Multi-object tracking meets moving UAV. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022. P. 8866-8875. DOI: 10.1109/CVPR52688.2022.00867
Mohsan S.A.H. et al. Unmanned aerial vehicles (UAVs): practical aspects, applications, open challenges, security issues, and future trends. Intelligent Service Robotics. 2023. Vol. 16. P. 109-137. DOI: 10.1007/s11370-022-00452-4
Hong T. et al. A real-time tracking algorithm for multi-target UAV based on deep learning. Remote Sensing. 2023. Vol. 15, No. 1. DOI: 10.3390/rs15010002
Arafat M.Y., Alam M.M., Moh S.
Vision-based navigation techniques for unmanned aerial vehicles: Review and challenges. Drones. 2023. Vol. 7. DOI: 10.3390/drones7020089
Koubaa A. et al. Micro Air Vehicle Link (MAVLink) in a nutshell: A survey. IEEE Access. 2019. Vol. 7. P. 87658-87680. DOI: 10.1109/ACCESS.2019.2924410
Ortega L.D., Olivares-Mendez M.A., Campoy P. Low-cost computer-vision-based embedded systems for UAVs. Robotics. 2023. Vol. 12. DOI: 10.3390/robotics12060145
Lukash Y., Prystavka P. A research platform for vision-based UAV autonomy: Architecture and implementation. CEUR Workshop Proceedings. 2025. Vol. 4024. P. 250-259.
Жуков І., Лукаш Ю. Використання алгоритму трекінгу об’єкту по відеозображенню для реалізації автономної функції слідування за ціллю для БПЛА. Проблеми інформатизації та управління. 2024. Т. 2, № 78. С. 14-17. DOI: 10.18372/2073-4751.78.18956
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
The scientific journal adheres to the principles of Open Access and provides free, immediate, and permanent access to all published materials without financial, technical, or legal barriers for readers.
All articles are published in Open Access under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.
Copyright
Authors who publish their works in the journal:
-
retain the copyright to their publications;
-
grant the journal the right of first publication of the article;
-
agree to the distribution of their materials under the CC BY 4.0 license;
-
have the right to reuse, archive, and distribute their works (including in institutional and subject repositories), provided that proper reference is made to the original publication in the journal.




