Cristin-resultat-ID: 1894489
Sist endret: 9. mars 2021, 14:37
NVI-rapporteringsår: 2020
Vitenskapelig Kapittel/Artikkel/Konferanseartikkel

Explaining Predictive Models with Mixed Features Using Shapley Values and Conditional Inference Trees

  • Annabelle Alice Redelmeier
  • Martin Jullum og
  • Kjersti Aas


Lecture Notes in Computer Science (LNCS)
  • 978-3-030-58805-2


NVI-nivå 1


Lecture Notes in Computer Science (LNCS)
ISSN 0302-9743
e-ISSN 1611-3349
NVI-nivå 1

Om resultatet

Vitenskapelig Kapittel/Artikkel/Konferanseartikkel
Publiseringsår: 2020
Hefte: 12377
Sider: 117 - 137
  • 978-3-030-58805-2
Open Access


Fagfelt (NPI)

Fagfelt: IKT
- Fagområde: Realfag og teknologi

Beskrivelse Beskrivelse


Explaining Predictive Models with Mixed Features Using Shapley Values and Conditional Inference Trees


It is becoming increasingly important to explain complex, black-box machine learning models. Although there is an expanding literature on this topic, Shapley values stand out as a sound method to explain predictions from any type of machine learning model. The original development of Shapley values for prediction explanation relied on the assumption that the features being described were independent. This methodology was then extended to explain dependent features with an underlying continuous distribution. In this paper, we propose a method to explain mixed (i.e. continuous, discrete, ordinal, and categorical) dependent features by modeling the dependence structure of the features using conditional inference trees. We demonstrate our proposed method against the current industry standards in various simulation studies and find that our method often outperforms the other approaches. Finally, we apply our method to a real financial data set used in the 2018 FICO Explainable Machine Learning Challenge and show how our explanations compare to the FICO challenge Recognition Award winning team.


Annabelle Alice Redelmeier

  • Tilknyttet:
    ved Avdeling for statistisk analyse og maskinlæring for brukermotiverte anvendelser SAMBA ved Norsk Regnesentral

Martin Jullum

  • Tilknyttet:
    ved Avdeling for statistisk analyse og maskinlæring for brukermotiverte anvendelser SAMBA ved Norsk Regnesentral

Kjersti Aas

  • Tilknyttet:
    ved Avdeling for statistisk analyse og maskinlæring for brukermotiverte anvendelser SAMBA ved Norsk Regnesentral
1 - 3 av 3

Resultatet er en del av Resultatet er en del av

Lecture Notes in Computer Science (LNCS).

Miesenberger, Klaus; Manduchi, Roberto; Rodriguez, Mario Covarrubias; Peňáz, Petr. 2020, Springer. Vitenskapelig antologi/Konferanseserie
1 - 1 av 1