Introducing New Measures of Inter- and Intra-Rater Agreement to Assess the Reliability of Medical Ground Truth

Campagner, Andrea; Cabitza, Federico

doi:10.3233/SHTI200167

loading subjects...

Introducing New Measures of Inter- and Intra-Rater Agreement to Assess the Reliability of Medical Ground Truth

Authors

Andrea Campagner, Federico Cabitza

Pages

282 - 286

DOI

10.3233/SHTI200167

Category

Research Article

Series

Studies in Health Technology and Informatics

Ebook

Volume 270: Digital Personalized Health and Medicine

Abstract

In this paper, we present and discuss two new measures of inter- and intra-rater agreement to assess the reliability of the raters, and hence of their labeling, in multi-rater setings, which are common in the production of ground truth for machine learning models. Our proposal is more conservative of other existing agreement measures, as it considers a more articulated notion of agreement by chance, based on an empirical estimation of the precision (or reliability) of the single raters involved. We discuss the measures in light of a realistic annotation tasks that involved 13 expert radiologists in labeling the MRNet dataset.

Contact

IOS Press Copyright 2024

Contact

IOS Press Copyright 2024

This website uses cookies

This website uses cookies