Explain to Gain: Introspective Reinforcement Learning for Enhanced Performance

Santiago Quintana-Amate, Delaney Stevens, Varniethan Ketheeswaran, Patrick Capaldo, Dylan Sheldon, Mark Hall

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

Abstract

This article presents a new method for leveraging explainable reinforcement learning (XRL) knowledge to enhance the performance of reinforcement learning (RL) agents. Although current XRL approaches are mainly focused on improving interpretability and user trust by providing explanations for agent actions, their ability to guide and optimise RL agent’s training is under-explored. To address this gap, we extend an existing introspective analysis framework by integrating XRL metrics directly into the training pipelines of model-free RL algorithms. This integration allows dynamic adjustments of algorithm-specific parameters based on real-time feedback from XRL metrics. The proposed methodology is validated across diverse OpenAI Gym environments (CartPole and Taxi). By evaluating both on-policy and off-policy approaches, we demonstrate that incorporating XRL insights leads to significant improvements in agent performance. The analysis of the results highlights the benefits regarding enhanced explainability and optimised decision-making. This work contributes in XRL research area by aligning interpretability with actionable performance gains, paving the way for more reliable and transparent RL systems in complex, real-world applications.

Original languageEnglish
Title of host publicationExplainable Artificial Intelligence - 3rd World Conference, xAI 2025, Proceedings
EditorsRiccardo Guidotti, Ute Schmid, Luca Longo
Place of PublicationCham, Switzerland
PublisherSpringer
Pages247-270
Number of pages24
ISBN (Electronic)9783032083241
ISBN (Print)9783032083234
DOIs
Publication statusAcceptance date - 31 Jul 2009
Externally publishedYes
Event3rd World Conference on Explainable Artificial Intelligence, xAI 2025 - Istanbul, Turkey
Duration: 9 Jul 202511 Jul 2025

Publication series

NameCommunications in Computer and Information Science
Volume2577 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference3rd World Conference on Explainable Artificial Intelligence, xAI 2025
Country/TerritoryTurkey
CityIstanbul
Period9/07/2511/07/25

Keywords

  • Dynamic Algorithm Adjustment
  • Explainability Metrics
  • Explainable autonomous agents
  • Explainable Reinforcement Learning (XRL)
  • Exploration-Exploitation Trade-off
  • Introspective Reinforcement Learning (IxDRL)

ASJC Scopus subject areas

  • General Computer Science
  • General Mathematics

Fingerprint

Dive into the research topics of 'Explain to Gain: Introspective Reinforcement Learning for Enhanced Performance'. Together they form a unique fingerprint.

Cite this