Algorithm for load balancing in distributed control systems for large-scale data streams in real time

Authors

DOI:

https://doi.org/10.18372/2073-4751.86.21269

Keywords:

load balancing, distributed systems, data flows, real time, adaptive algorithm, delay, throughput, rebalancing

Abstract

The article analyses algorithmic methods of load balancing in distributed control systems that process large volumes of data streams under real-time constraints. As flow rates and architectural complexity increase, the operational goal is to achieve throughput and limited latency while ensuring even utilization of heterogeneous computing resources. Previous studies show that in real-time environments, misallocation of work leads to queues, long wait times, and delays in message processing. The main causes of imbalance are peak loads, rapid changes in flow intensity, concentration of events around “hot” buttons, and hardware heterogeneity in nodes. State-of-the-art load balancing strategies are systematized, including static, dynamic, and adaptive algorithms, as well as feedback-based mechanisms. The limitations of static policies in streaming settings are emphasized, and the foundations of algorithms capable of quickly responding to system state changes are established. A conceptual adaptive approach to load balancing based on online node performance indicators, including task queue occupancy, CPU load, and task execution time, is proposed. The performance of the balancing strategies is evaluated using throughput, p95 and p99 delay quantiles, node-level load uniformity, and the proportion of lost messages. The results show that adaptive load balancing improves the reliability of distributed systems by reducing the risk of congestion and stabilizing queuing delay during heavy flow processing. Further research should focus on workload prediction, context-sensitive threshold adaptation, and environment-sensitive reallocation policy design for streaming manufacturing platforms.

References

Вовченко Д. С., Олещенко Л. М. Аналіз алгоритмів балансування вузлів в програмних системах розподіленої обробки великих даних. Вчені записки ТНУ імені В. І. Вернадського. Серія: Технічні науки. 2025. Т. 36 (75), № 2. DOI: https://doi.org/10.32782/2663-5941/2025.2.2/06

Складанний П. М., Костюк Ю. В., Рзаєва С. Л., Мазур Н. П. Паралельна обробка даних у розширювальних хеш-структурах та оцінка їх продуктивності. Кібербезпека: освіта, наука, техніка. 2025. № 3 (31). С. 243–269. DOI: https://doi.org/10.28925/2663-4023.2025.31.1015

Sun X. Dynamic distributed scheduling for data stream computing: balancing task delay and load efficiency. Journal of Computer Technology and Software. 2025. Vol. 4, № 1. DOI: https://doi.org/10.5281/zenodo.14785261

Singh N., Hamid Y., Juneja S. Load balancing and service discovery using Docker Swarm for microservice based big data applications. Journal of Cloud Computing. 2023. Vol. 12. Art. 4. DOI: https://doi.org/10.1186/s13677-022-00358-7

Pasham S. D. Graph-Based Algorithms for Optimizing Data Flow in Distributed Cloud Architectures. International Journal of Acta Informatica. 2022. Vol. 1, № 1. P. 67–95. URL: https://www.yuktabpublisher.com/index.php/IJAI/article/view/173/129

Wei C., Hou J., Ma D., Zhao J., Sun Y. Design and implementation of a TCP long connection load balancing algorithm based on negative feedback mechanism. Journal of Physics: Conference Series. 2020. Vol. 1659. Art. 012001. DOI: https://doi.org/10.1088/1742-6596/1659/1/012001

Kashani M. H., Mahdipour E. Load balancing algorithms in fog computing. IEEE Transactions on Services Computing. 2022. Vol. 16, № 2. P. 1505–1521. DOI: https://doi.org/10.1109/TSC.2022.3174475

Enjam G. R. Energy-Efficient Load Balancing in Distributed Insurance Systems Using AI-Optimized Switching Techniques. International Journal of Artificial Intelligence, Data Science, and Machine Learning. 2022. Vol. 3, № 4. P. 68–76. DOI: https://doi.org/10.63282/3050-9262.IJAIDSML-V3I4P108

Zhang H., Jia X., Chen C. et al. Deep learning-based real-time data quality assessment and anomaly detection for large-scale distributed data streams. International Journal of Medical and All Body Health Research. 2025. Vol. 6, № 1. P. 1–11. DOI: https://doi.org/10.54660/IJMBHR.2025.6.1.01-11

Sah D. K., Nguyen T. N., Cengiz K. et al. Load-balance scheduling for intelligent sensors deployment in industrial internet of things. Cluster Computing. 2022. Vol. 25. P. 1715–1727. DOI: https://doi.org/10.1007/s10586-021-03316-1

Published

2026-05-30

How to Cite

Havrylenko, D. (2026). Algorithm for load balancing in distributed control systems for large-scale data streams in real time. Problems of Informatization and Control, 2(86), 15–22. https://doi.org/10.18372/2073-4751.86.21269

Issue

Section

Статті