Abstract
Reinforcement learning has achieved remarkable success in complex decision-making environments, yet its lack of transparency limits its deployment in practice, especially in safety-critical settings. Shapley values from cooperative game theory provide a principled framework for explaining reinforcement learning; however, the computational cost of Shapley explanations is an obstacle to their use. We introduce FastSVERL, a scalable method for explaining reinforcement learning by approximating Shapley values. FastSVERL is designed to handle the unique challenges of reinforcement learning, including temporal dependencies across multi-step trajectories, learning from off-policy data, and adapting to evolving agent behaviours in real time. FastSVERL introduces a practical, scalable approach for principled and rigorous interpretability in reinforcement learning.
| Original language | English |
|---|---|
| Journal | Advances in Neural Information Processing Systems |
| Early online date | 3 Dec 2025 |
| Publication status | Published - 3 Dec 2025 |
Bibliographical note
Camera-ready version. Published at the Conference on Neural Information Processing Systems (NeurIPS 2025)Keywords
- cs.LG
Fingerprint
Dive into the research topics of 'Approximating Shapley Explanations in Reinforcement Learning'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS