News

New measures for evaluating CNA models

Luna De Souter identifies shortcomings of the two main measures for evaluating CNA models, consistency and coverage, and introduces two new evaluation measures.

Photo:

JCI_title

Main content

Published: 15.01.2024

Abstract

Configurational Comparative Methods (CCMs) aim to learn causal structures from datasets by exploiting Boolean sufficiency and necessity relationships. One important challenge for these methods is that such Boolean relationships are often not satisfied in real-life datasets, as these datasets usually contain noise. Hence, CCMs infer models that only approximately fit the data, introducing a risk of inferring incorrect or incomplete models, especially when data are also fragmented (have limited empirical diversity). To minimize this risk, evaluation measures for sufficiency and necessity should be sensitive to all relevant evidence. This article points out that the standard evaluation measures in CCMs, consistency and coverage, neglect certain evidence for these Boolean relationships. Correspondingly, two new measures, contrapositive consistency and contrapositive coverage, which are equivalent to the binary classification measures specificity and negative predictive value, respectively, are introduced to the CCM context as additions to consistency and coverage. A simulation experiment demonstrates that the introduced contrapositive measures indeed help to identify correct CCM models.

Luna De Souter, (2024), Evaluating Boolean relationships in Configurational Comparative Methods, Journal of Causal Inference, vol. 12, no. 1, 2024, doi:10.1515/jci-2023-0014