Abstract
Traditionally, explainable artificial intelligence seeks to provide explanation and interpretability of high-performing black-box models such as deep neural networks. Interpretation of such models remains difficult, because of their high complexity. An alternative method is to instead force a deep-neural network to use human-intelligible features as the basis for its decisions. We tested this approach using the natural category domain of rock types. We compared the performance of a black-box implementation of transfer-learning using Resnet50 to that of a network first trained to predict expert-identified features and then forced to use these features to categorise rock images. The performance of this feature-constrained network was virtually identical to that of the unconstrained network. Further, a partially constrained network forced to condense down to a small number of features that was not trained with expert features did not result in these abstracted features being intelligible; nevertheless, an affine transformation of these features could be found that aligned well with expert-intelligible features. These findings show that making an AI intrinsically intelligible need not be at the cost of performance.
Original language | English |
---|---|
Title of host publication | Procedia Computer Science |
Publisher | Elsevier |
Number of pages | 8 |
Volume | 207 |
Edition | 2022 |
DOIs | |
Publication status | Published - 26 Jan 2023 |
Event | 26th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems - Verona, Italy Duration: 7 Sept 2022 → 9 Sept 2022 Conference number: 26 http://kes2022.kesinternational.org/ |
Conference
Conference | 26th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems |
---|---|
Abbreviated title | KES-2022 |
Country/Territory | Italy |
City | Verona |
Period | 7/09/22 → 9/09/22 |
Internet address |
Keywords
- Explainable AI
- Human Factors
- Deep Neural Networks
- Self-explanatory AI
- Deep neural networks
- Transfer learning
- XAI
- Category learning
- Trustworthy AI
- Transparent AI
- Responsible AI