IMPLEMENTATION OF ARTIFICIAL INTELLIGENCE INTO THE INCIDENT MANAGEMENT PROCESS
DOI:
https://doi.org/10.32689/maup.it.2024.4.9Keywords:
AIOps, incident management, Artificial Intelligence, ITSM, process implementationAbstract
The rapid advancement of technology necessitates the transformation of IT Service Management (ITSM) practices. This article examines the integration of Artificial Intelligence for IT Operations (AIOps) and intelligent automation within ITSM, focusing on their impact on incident management, operational efficiency, and overall service quality. Based on a review of recent literature and case studies, the article aims to provide insights into the benefits, challenges, and future directions of these technologies in improving IT operations. The findings highlight the significant potential of AIOps and intelligent automation to enhance IT service management efficiency; however, their implementation requires careful planning and consideration of specific factors. The proposed implementation methodology can be widely utilized by organizations to further advance AIOps adoption. The goal of the study is to develop a methodology for integrating AI-driven automations into an organization’s technical and business processes. The tasks required to achieve this include researching existing approaches to the classification and application of AI automations in the field, analyzing current systems and implementation experiences, and describing the methodology for integrating AI automations. The methodology includes a review of literature and an analysis of existing use cases for automation in processes. Using system analysis tools, a methodology for integrating such automations into business processes has been developed. The scientific novelty lies in the adaptation of cutting-edge approaches in incident management to the current processes of technology-driven organizations. Conclusions. This study proposes a step-by-step methodology for implementing AI-based automation in business processes through the deployment of AIOps infrastructure. Applying the developed methodology and conducting further case studies to identify potential organizational process nuances represent promising directions for future research.
References
Databricks. Effective AIOps with Open Source Software in a Week. Youtube, 2021. URL: https://www.youtube.com/watch?v=NuL1u_CIkQw (дата звернення: 20.11.2024).
Diachenko Maksym, Roskladka Andrii. 2023. Approaches in managing IT services and implementation of AI automations. X INTERNATIONAL CONFERENCE Information Technology and Implementation (Satellite).C. 233.
Google Cloud Tech. Build an AIOps platform at enterprise scale with Google Cloud. Youtube. 2023 URL: https://www.youtube.com/watch?v=UdVaexipP6w (дата звернення: 20.11.2024).
Ira Cohen, Jeffrey S Chase, Moises Goldszmidt, Terence Kelly, and Julie Symons. 2004. Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control. In OSDI, Vol. 4. 16–16. URL: https://dl.acm.org/doi/10.5555/1251254.1251270 (дата звернення: 20.11.2024).
Jiaxi Xu, Fei Wang, Jun Ai. Defect prediction with semantics and context features of codes based on graph representation learning. IEEE Transactions on Reliability 70. 2020. 613–625. URL: https://doi.org/10.1109/TR.2020.3040191 (дата звернення: 20.11.2024).
Junjie Chen, Shu Zhang, Xiaoting He, Qingwei Lin, Hongyu Zhang, Dan Hao, Yu Kang, Feng Gao, Zhangwei Xu, Yingnong Dang, et al. 2020. How incidental are the incidents? characterizing and prioritizing incidents for largescale online service systems. In Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering. C. 373–384. URL: https://doi.org/10.1145/3324884.3416624 (дата звернення: 20.11.2024).
Junjie Chen, Xiaoting He, Qingwei Lin, Hongyu Zhang, Dan Hao, Feng Gao, Zhangwei Xu, Yingnong Dang, Dongmei Zhang. 2019. Continuous incident triage for large-scale online service systems. 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE. C. 364–375. URL: https://doi.org/10.1109/ASE.2019.00042 (дата звернення: 20.11.2024).
Justin Meza, Qiang Wu,Sanjev Kumar, and Onur Mutlu. 2015. A large-scale studyof flash memory failures in the field. ACM SIGMETRICS Performance Evaluation Review 43, 1. C.177–190. URL: https://doi.org/10.1145/2796314.2745848 (дата звернення: 20.11.2024).
Kasem Khalil, Omar Eldash, Ashok Kumar, Magdy Bayoumi. 2020. Machine learning-based approach for hardware faults prediction. IEEE Transactions on Circuits and Systems I: Regular Papers 67, 11. C. 3880–3892. URL: https://doi.org/10.1109/TCSI.2020.3010743 (дата звернення: 20.11.2024).
Md Nasir Uddin, Bixin Li, Zafar Ali, Pavlos Kefalas, Inayat Khan, Islam Zada. 2022. Software defect prediction employing BiLSTM and BERT-based semantic feature. Soft Computing 26, 16. C.7877–7891. URL: https://doi.org/10.1007/s00500-022-06830-5 (дата звернення: 20.11.2024).
Nagappan Nachiappan, Ball Thomas, Zeller Andreas. 2006. Mining metrics to predict component failures. In Proceedings of the 28th international conference on Software engineering. C. 452–461. URL: http://dx.doi.org/10.1145/1134349 (дата звернення: 20.11.2024).
Nengwen Zhao, Junjie Chen, Zhou Wang, Xiao Peng, Gang Wang, Yong Wu, Fang Zhou, Zhen Feng, Xiaohui Nie, Wenchi Zhang, et al. 2020. Real-time incident prediction for online service systems. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering.C. 315–326. URL: https://doi.org/10.1145/3368089.3409672 (дата звернення: 20.11.2024).
Qingwei Lin, Ken Hsieh, Yingnong Dang, Hongyu Zhang, Kaixin Sui, Yong Xu, Jian-Guang Lou, Chenggang Li, Youjiang Wu, Randolph Yao, et al. 2018. Predicting node failure in cloud service systems. In Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. С. 480–490. URL: https://doi.org/10.1145/3236024.3236060 (дата звернення: 20.11.2024).
Reiter Lena. 2021. AIOps – A Systematic Literature Review. Seminar IT-Management in the Digital Age. FH Wedel, Germany. URL: https://www.fh-wedel.de/fileadmin/Mitarbeiter/Records/Reiter_2021_-_AIOps_-_A_Systematic_Literature_Review.pdf (дата звернення: 20.11.2024).
Remil Youcef, et al. 2024. Aiops solutions for incident management: Technical guidelines and a comprehensive literature review. URL: https://arxiv.org/abs/2404.01363 (дата звернення: 20.11.2024).
Sabharwal N. 2022. Hands-on AIOps. Springer.
Stephen Elliot. 2014. Dev Ops and the cost of downtime: Fortune 1000 best practice metrics quantified. International Data Corporation (IDC).
Sun-Ro Lee, Min-Jae Heo, Chan-Gun Lee, Milhan Kim, Gaeul Jeong. 2017. Applying deep learning based automatic bug triager to industrial projects. In ESEC/FSE. ACM. C.926–931. URL: https://doi.org/10.1145/3106237.3117776 (дата звернення: 20.11.2024).
Wubai Zhou, Liang Tang, Chunqiu Zeng, Tao Li, Larisa Shwartz, and Genady Ya Grabarnik. 2016. Resolution recommendation for event tickets in service management. IEEE Transactions on Network and Service Management 13, 4 (2016), 954–967. URL: https://doi.org/10.1109/INM.2015.7140303 (дата звернення: 20.11.2024).
Xianping Quand Jingjing Ha. 2017. Next generation of devops: Aiops in practice. SREcon17.
Ying Zhao, Xiang Liu, Siqing Gan, and Weimin Zheng. 2010. Predicting disk failures with HMM-and HSMM-based approaches. In Advances in Data Mining. Applications and Theoretical Aspects: 10th Industrial Conference, ICDM 2010, Berlin, Germany, July 12-14. Proceedings 10. Springer. C. 390–404. URL: https://doi.org/10.1007/978-3-642-14400-4_30 (дата звернення: 20.11.2024).
Yingnong Dang, Qingwei Lin, and Peng Huang. AIOps: real-world challenges and research innovations. In 2019 IEEE/ACM 41st International Conference on Software Engineering: Companion Proceedings (ICSE-Companion). IEEE, 2019. C. 4–5. URL: https://doi.org/10.1109/ICSE-Companion.2019.00023 (дата звернення: 20.11.2024).