Author
Listed:
- Jessie J. J. Gommers
(Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands)
- Craig K. Abbey
(Department of Psychological and Brain Sciences, University of California, Santa Barbara, CA, USA)
- Fredrik Strand
(Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden
Breast Radiology, Karolinska University Hospital, Stockholm, Sweden)
- Sian Taylor-Phillips
(Warwick Medical School, University of Warwick, Coventry, UK)
- David J. Jenkinson
(Warwick Medical School, University of Warwick, Coventry, UK)
- Marthe Larsen
(Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway)
- Solveig Hofvind
(Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway
Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway)
- Mireille J. M. Broeders
(Dutch Expert Center for Screening (LRCB), Nijmegen, The Netherlands
IQ Health Science Department, Radboud University Medical Center, Nijmegen, The Netherlands)
- Ioannis Sechopoulos
(Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands
Dutch Expert Center for Screening (LRCB), Nijmegen, The Netherlands
Technical Medicine Center, University of Twente, Enschede, The Netherlands)
Abstract
Purpose To develop a model that simulates radiologist assessments and use it to explore whether pairing readers based on their individual performance characteristics could optimize screening performance. Methods Logistic regression models were designed and used to model individual radiologist assessments. For model evaluation, model-predicted individual performance metrics and paired disagreement rates were compared against the observed data using Pearson correlation coefficients. The logistic regression models were subsequently used to simulate different screening programs with reader pairing based on individual true-positive rates (TPR) and/or false-positive rates (FPR). For this, retrospective results from breast cancer screening programs employing double reading in Sweden, England, and Norway were used. Outcomes of random pairing were compared against those composed of readers with similar and opposite TPRs/FPRs, with positive assessments defined by either reader flagging an examination as abnormal. Results The analysis data sets consisted of 936,621 (Sweden), 435,281 (England), and 1,820,053 (Norway) examinations. There was good agreement between the model-predicted and observed radiologists’ TPR and FPR ( r  ≥ 0.969). Model-predicted negative-case disagreement rates showed high correlations ( r  ≥ 0.709), whereas positive-case disagreement rates had lower correlation levels due to sparse data ( r  ≥ 0.532). Pairing radiologists with similar FPR characteristics (Sweden: 4.50% [95% confidence interval: 4.46%–4.54%], England: 5.51% [5.47%–5.56%], Norway: 8.03% [7.99%–8.07%]) resulted in significantly lower FPR than with random pairing (Sweden: 4.74% [4.70%–4.78%], England: 5.76% [5.71%–5.80%], Norway: 8.30% [8.26%–8.34%]), reducing examinations sent to consensus/arbitration while the TPR did not change significantly. Other pairing strategies resulted in equal or worse performance than random pairing. Conclusions Logistic regression models accurately predicted screening mammography assessments and helped explore different radiologist pairing strategies. Pairing readers with similar modeled FPR characteristics reduced the number of examinations unnecessarily sent to consensus/arbitration without significantly compromising the TPR. Highlights A logistic-regression model can be derived that accurately predicts individual and paired reader performance during mammography screening reading. Pairing screening mammography radiologists with similar false-positive characteristics reduced false-positive rates with no significant loss in true positives and may reduce the number of examinations unnecessarily sent to consensus/arbitration.
Suggested Citation
Jessie J. J. Gommers & Craig K. Abbey & Fredrik Strand & Sian Taylor-Phillips & David J. Jenkinson & Marthe Larsen & Solveig Hofvind & Mireille J. M. Broeders & Ioannis Sechopoulos, 2024.
"Modeling Radiologists’ Assessments to Explore Pairing Strategies for Optimized Double Reading of Screening Mammograms,"
Medical Decision Making, , vol. 44(7), pages 828-842, October.
Handle:
RePEc:sae:medema:v:44:y:2024:i:7:p:828-842
DOI: 10.1177/0272989X241264572
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:medema:v:44:y:2024:i:7:p:828-842. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.