Knowledge-based Intelligent System for IT Incident DevOps

Salman Ahmed, Muskaan Singh, Brendan Doherty, Effirul Ramlan, Kathryn Harkin, Magda Bucholc, Damien Coyle

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

2 Citations (SciVal)

Abstract

The automation of IT incident management (i.e., handling of any unusual events that hamper the quality of IT services) is a main focus in Artificial Intelligence for IT Operations (AIOPS). The success and reputation of large-scale firms depend on their customer service and helpdesk system. These systems tend to handle client requests and track customer service agent interactions. In this research, we present a complete knowledge-based system that automates two core components of IT incident service management (ITSM): (1) Ticket Assignment Group(TAG) and (2) Incident Resolution (IR). Our proposed system bypasses the 4 core steps of the traditional ITSM process, including data investigation, event correlation, situation room collaboration, and probable root cause. It provides immediate solutions that can save companies key performance indicator(KPIs) resources and reduce the mean time to resolution (MTTR). The experiment used an industrial, real-time ITSM dataset from a prominent IT organization comprising 500,000 real-time incident descriptions with encoded labels. Furthermore, our systems are then evaluated with an open-source dataset. Compared to the existing benchmark methodologies, there is a 5 % improvement in terms of Accuracy score. The study demonstrates AI automation capabilities in incident handling (TAG and IR) for large real- world IT systems.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE/ACM International Workshop on Cloud Intelligence and AIOps, AIOps 2023
PublisherIEEE
Pages1-7
Number of pages7
ISBN (Electronic)9798350323740
DOIs
Publication statusPublished - 25 Jul 2023
Event2023 IEEE/ACM International Workshop on Cloud Intelligence and AIOps, AIOps 2023 - Melbourne, Australia
Duration: 15 May 2023 → …

Publication series

NameProceedings - 2023 IEEE/ACM International Workshop on Cloud Intelligence and AIOps, AIOps 2023

Conference

Conference2023 IEEE/ACM International Workshop on Cloud Intelligence and AIOps, AIOps 2023
Country/TerritoryAustralia
CityMelbourne
Period15/05/23 → …

Bibliographical note

Funding Information:
VII. ACKNOWLEDGMENT We are grateful for access to the Tier 2 High-Performance Computing resources provided by the Northern Ireland High- Performance Computing (NI-HPC) facility funded by the UK Engineering and Physical Sciences Research Council (EP-SRC), Grant Nos. EP/T022175/ and EP/W03204X/1. Damien Coyle is supported by the UKRI Turing AI Fellowship 2021-2025 funded by the EPSRC (grant number EP/V025724/1). Salman Ahmed is supported by a Dr. George Moore Ph.D. scholarship.

Funding

VII. ACKNOWLEDGMENT We are grateful for access to the Tier 2 High-Performance Computing resources provided by the Northern Ireland High- Performance Computing (NI-HPC) facility funded by the UK Engineering and Physical Sciences Research Council (EP-SRC), Grant Nos. EP/T022175/ and EP/W03204X/1. Damien Coyle is supported by the UKRI Turing AI Fellowship 2021-2025 funded by the EPSRC (grant number EP/V025724/1). Salman Ahmed is supported by a Dr. George Moore Ph.D. scholarship.

Keywords

  • Artificial Intelligence for IT Operations (AIOPS)
  • Assignment Group
  • Dataset Imbalance
  • Information Technology Infrastructure Library (ITIL)
  • IT Incidents
  • IT Service Management (ITSM)
  • Risk prediction
  • Text Resolution

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'Knowledge-based Intelligent System for IT Incident DevOps'. Together they form a unique fingerprint.

Cite this