INCREASING THE EFFICIENCY OF CLOUD RESOURCE MANAGEMENT WITH THE HELP OF DISTRIBUTED MONITORING
DOI:
https://doi.org/10.32689/maup.it.2025.2.11Keywords:
cloud technologies, monitoring, management, parallel computing, cloud computing securityAbstract
The purpose of this work is to develop an architecture for a distributed monitoring system capable of adaptively responding to variable load, partial failures, and network degradation to improve the efficiency of cloud computing resource management.The research methodology is based on the development of a hierarchical architecture that includes monitoring agents with adaptive sampling rates, a fog layer with local ML models for load forecasting, and a central coordinator that performs global analytics and strategic management. During the study, an experimental prototype of the system was implemented based on a containerized environment using Docker, Apache Kafka, Python, Random Forest, and SHAP interpretation. A series of experiments were conducted under five scenarios: from nominal operation to stress conditions with loss of connectivity, overloads, and combined threats.Scientific novelty – for the first time, a combination of agent monitoring with a dynamic sampling rate and ML analytics at the fog level has been implemented to ensure high speed and accuracy of response in decentralized management conditions.The proposed model outperforms traditional centralized approaches in both qualitative and quantitative indicators.Conclusions. According to the results of experiments, it was found that the proposed architecture allows reducing the average system response time to events by 60–70 %, reducing telemetry data losses by 3–4 %, increasing the accuracy of load forecasting by up to 30 %, and reducing the number of erroneous actions in dynamic infrastructure conditions. In addition, the distributed nature of information processing provides a higher level of system stability by reducing fluctuations in resource allocation. The results obtained confirm the feasibility of implementing the proposed architecture in systems that require high reliability, adaptability and scalability – in particular, in edge/fog environments, industrial Internet of Things, smart data centers and critical real-time services.
References
Улізько В. О. Інтелектуальна система управління ресурсами в хмарних середовищах на основі машинного навчання. 2024. URL: https://ela.kpi.ua/handle/123456789/72021 (дата звернення: 10.06.2025).
Almutairi S., Alghanmi N., Monowar M. M. Survey of centralized and decentralized access control models in cloud computing. International Journal of Advanced Computer Science and Applications. 2021. Vol. 12, No. 2.
Barua B., Kaiser M. S. AI-driven resource allocation framework for microservices in hybrid cloud platforms. arXiv preprint. 2024. arXiv:2412.02610. https://doi.org/10.48550/arXiv.2412.02610.
Ilager S., Muralidhar R., Buyya R. Artificial intelligence (AI)-centric management of resources in modern distributed computing systems. 2020 IEEE Cloud Summit. 2020. P. 1–10. DOI: 10.1109/IEEECloudSummit48914.2020.00007.
Kotliarskyi A., Petrashenko A. Spobib pidvyshchennia efektyvnosti vykorystannia khmarnykh resursiv – A method for improving the efficiency of cloud resource usage. Computer-integrated technologies: education, science, production. 2024. No. 54. P. 125–129. https://doi.org/10.36910/6775-2524-0560-2024-54-14.
Li L., Bell J., Coppola M., Lomonaco V. Adaptive AI-based decentralized resource management in the cloud-edge continuum. arXiv preprint. 2025. arXiv:2501.15802. https://doi.org/10.48550/arXiv.2501.15802.
Marques G., Senna C., Sargento S., Carvalho L., Pereira L., Matos R. Proactive resource management for cloud of services environments. Future Generation Computer Systems. 2024. Vol. 150. P. 90–102. https://doi.org/10.1016/j.future.2023.08.005.
Moghaddam S.K., Buyya R., Ramamohanarao K. Performance-aware management of cloud resources: a taxonomy and future directions. ACM Computing Surveys (CSUR). 2019. Vol. 52, No. 4. P. 1–37. https://doi.org/10.1145/3337956.
Ravichandran N., Inaganti A. C., Muppalaneni R., Nersu S.R.K. AI-driven self-healing IT systems: automating incident detection and resolution in cloud environments. Artificial Intelligence and Machine Learning Review. 2020. Vol. 1, No. 4. P. 1–11. https://doi.org/10.69987/.
Tuli S., Gill S. S., Xu M., Garraghan P., Bahsoon R., Dustdar S., Jennings N. R. HUNTER: AI-based holistic resource management for sustainable cloud computing. Journal of Systems and Software. 2022. Vol. 184. P. 111124. https://doi.org/10.1016/j.jss.2021.111124.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Андрій КОБИЛЮК

This work is licensed under a Creative Commons Attribution 4.0 International License.