Explainable AI needs formal notions of explanation correctness

Haufe, Stefan; Wilming, Rick; Clark, Benedict; Zhumagambetov, Rustam; Panknin, Danny; Boubekki, Ahcène

Computer Science > Machine Learning

arXiv:2409.14590 (cs)

[Submitted on 22 Sep 2024 (v1), last revised 23 Nov 2024 (this version, v3)]

Title:Explainable AI needs formal notions of explanation correctness

Authors:Stefan Haufe, Rick Wilming, Benedict Clark, Rustam Zhumagambetov, Danny Panknin, Ahcène Boubekki

View PDF HTML (experimental)

Abstract:The use of machine learning (ML) in critical domains such as medicine poses risks and requires regulation. One requirement is that decisions of ML systems in high-risk applications should be human-understandable. The field of "explainable artificial intelligence" (XAI) seemingly addresses this need. However, in its current form, XAI is unfit to provide quality control for ML; it itself needs scrutiny. Popular XAI methods cannot reliably answer important questions about ML models, their training data, or a given test input. We recapitulate results demonstrating that popular XAI methods systematically attribute importance to input features that are independent of the prediction target. This limits their utility for purposes such as model and data (in)validation, model improvement, and scientific discovery. We argue that the fundamental reason for this limitation is that current XAI methods do not address well-defined problems and are not evaluated against objective criteria of explanation correctness. Researchers should formally define the problems they intend to solve first and then design methods accordingly. This will lead to notions of explanation correctness that can be theoretically verified and objective metrics of explanation performance that can be assessed using ground-truth data.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2409.14590 [cs.LG]
	(or arXiv:2409.14590v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2409.14590

Submission history

From: Stefan Haufe [view email]
[v1] Sun, 22 Sep 2024 20:47:04 UTC (840 KB)
[v2] Thu, 26 Sep 2024 12:29:45 UTC (840 KB)
[v3] Sat, 23 Nov 2024 23:02:49 UTC (819 KB)

Computer Science > Machine Learning

Title:Explainable AI needs formal notions of explanation correctness

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Explainable AI needs formal notions of explanation correctness

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators