Advanced search
1 file | 6.00 MB Add to list

Evaluating feature attribution methods in the image domain

(2024) MACHINE LEARNING. 113. p.6019-6064
Author
Organization
Project
Abstract
Feature attribution maps are a popular approach to highlight the most important pixels in an image for a given prediction of a model. Despite a recent growth in popularity and available methods, the objective evaluation of such attribution maps remains an open problem. Building on previous work in this domain, we investigate existing quality metrics and propose new variants of metrics for the evaluation of attribution maps. We confirm a recent finding that different quality metrics seem to measure different underlying properties of attribution maps, and extend this finding to a larger selection of attribution methods, quality metrics, and datasets. We also find that metric results on one dataset do not necessarily generalize to other datasets, and methods with desirable theoretical properties do not necessarily outperform computationally cheaper alternatives in practice. Based on these findings, we propose a general benchmarking approach to help guide the selection of attribution methods for a given use case. Implementations of attribution metrics and our experiments are available online (https://github.com/arnegevaert/benchmark-general-imaging).
Keywords
Explainability, Interpretability, Benchmark, Feature attribution, Saliency maps

Downloads

  • DS768.pdf
    • full text (Published version)
    • |
    • open access
    • |
    • PDF
    • |
    • 6.00 MB

Citation

Please use this url to cite or link to this publication:

MLA
Gevaert, Arne, et al. “Evaluating Feature Attribution Methods in the Image Domain.” MACHINE LEARNING, vol. 113, 2024, pp. 6019–64, doi:10.1007/s10994-024-06550-x.
APA
Gevaert, A., Rousseau, A.-J., Becker, T., Valkenborg, D., De Bie, T., & Saeys, Y. (2024). Evaluating feature attribution methods in the image domain. MACHINE LEARNING, 113, 6019–6064. https://doi.org/10.1007/s10994-024-06550-x
Chicago author-date
Gevaert, Arne, Axel-Jan Rousseau, Thijs Becker, Dirk Valkenborg, Tijl De Bie, and Yvan Saeys. 2024. “Evaluating Feature Attribution Methods in the Image Domain.” MACHINE LEARNING 113: 6019–64. https://doi.org/10.1007/s10994-024-06550-x.
Chicago author-date (all authors)
Gevaert, Arne, Axel-Jan Rousseau, Thijs Becker, Dirk Valkenborg, Tijl De Bie, and Yvan Saeys. 2024. “Evaluating Feature Attribution Methods in the Image Domain.” MACHINE LEARNING 113: 6019–6064. doi:10.1007/s10994-024-06550-x.
Vancouver
1.
Gevaert A, Rousseau A-J, Becker T, Valkenborg D, De Bie T, Saeys Y. Evaluating feature attribution methods in the image domain. MACHINE LEARNING. 2024;113:6019–64.
IEEE
[1]
A. Gevaert, A.-J. Rousseau, T. Becker, D. Valkenborg, T. De Bie, and Y. Saeys, “Evaluating feature attribution methods in the image domain,” MACHINE LEARNING, vol. 113, pp. 6019–6064, 2024.
@article{01HZS1BVAPHWG4GRM3DQVMNRWD,
  abstract     = {{Feature attribution maps are a popular approach to highlight the most important pixels in an image for a given prediction of a model. Despite a recent growth in popularity and available methods, the objective evaluation of such attribution maps remains an open problem. Building on previous work in this domain, we investigate existing quality metrics and propose new variants of metrics for the evaluation of attribution maps. We confirm a recent finding that different quality metrics seem to measure different underlying properties of attribution maps, and extend this finding to a larger selection of attribution methods, quality metrics, and datasets. We also find that metric results on one dataset do not necessarily generalize to other datasets, and methods with desirable theoretical properties do not necessarily outperform computationally cheaper alternatives in practice. Based on these findings, we propose a general benchmarking approach to help guide the selection of attribution methods for a given use case. Implementations of attribution metrics and our experiments are available online (https://github.com/arnegevaert/benchmark-general-imaging).}},
  author       = {{Gevaert, Arne and Rousseau, Axel-Jan and Becker, Thijs and Valkenborg, Dirk and De Bie, Tijl and Saeys, Yvan}},
  issn         = {{0885-6125}},
  journal      = {{MACHINE LEARNING}},
  keywords     = {{Explainability,Interpretability,Benchmark,Feature attribution,Saliency maps}},
  language     = {{eng}},
  pages        = {{6019--6064}},
  title        = {{Evaluating feature attribution methods in the image domain}},
  url          = {{http://doi.org/10.1007/s10994-024-06550-x}},
  volume       = {{113}},
  year         = {{2024}},
}

Altmetric
View in Altmetric
Web of Science
Times cited: