Abstract
Traditionally, explainable artificial intelligence seeks to provide explanation and interpretability of high-performing black-box models such as deep neural networks. Interpretation of such models remains difficult, because of their high complexity. An alternative method is to instead force a deep-neural network to use human-intelligible features as the basis for its decisions. We tested this approach using the natural category domain of rock types. We compared the performance of a black-box implementation of transfer-learning using Resnet50 to that of a network first trained to predict expert-identified features and then forced to use these features to categorise rock images. The performance of this feature-constrained network was virtually identical to that of the unconstrained network. Further, a partially constrained network forced to condense down to a small number of features that was not trained with expert features did not result in these abstracted features being intelligible; nevertheless, an affine transformation of these features could be found that aligned well with expert-intelligible features. These findings show that making an AI intrinsically intelligible need not be at the cost of performance.
| Original language | English |
|---|---|
| Title of host publication | Procedia Computer Science |
| Subtitle of host publication | 26th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES 2022) |
| Editors | Matteo Cristani, Carlos Toro, Cecilia Zanni-Merk, Robert J. Howlett, Lakhmi C. Jain |
| Place of Publication | Procedia Computer Science |
| Publisher | Elsevier |
| Pages | 3600-3607 |
| Number of pages | 8 |
| Volume | 207 |
| Edition | 2022 |
| DOIs | |
| Publication status | Published - 26 Jan 2023 |
| Event | 26th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems - Verona, Italy Duration: 7 Sept 2022 → 9 Sept 2022 Conference number: 26 http://kes2022.kesinternational.org/ |
Conference
| Conference | 26th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems |
|---|---|
| Abbreviated title | KES-2022 |
| Country/Territory | Italy |
| City | Verona |
| Period | 7/09/22 → 9/09/22 |
| Internet address |
Acknowledgements
This work was funded by an internal grant from the Cardiff University Centre for Artificial Intelligence, Robotics and Human-Machine Systems (IROHMS) and conducted in the School of Psychology IROHMS Simulation Laboratory as part of the Human Factors Excellence (HuFEx) research group.Funding
This work was funded by an internal grant from the Cardiff University Centre for Artificial Intelligence, Robotics and Human-Machine Systems (IROHMS) and conducted in the School of Psychology IROHMS Simulation Laboratory as part of the Human Factors Excellence (HuFEx) research group.
Keywords
- Explainable AI
- Human Factors
- Deep Neural Networks
- Self-explanatory AI
- Deep neural networks
- Transfer learning
- XAI
- Category learning
- Trustworthy AI
- Transparent AI
- Responsible AI