• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

建立放射科医生的评估模型,探索优化筛查性乳房 X 光照片双读的配对策略。

Modeling Radiologists' Assessments to Explore Pairing Strategies for Optimized Double Reading of Screening Mammograms.

机构信息

Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands.

Department of Psychological and Brain Sciences, University of California, Santa Barbara, CA, USA.

出版信息

Med Decis Making. 2024 Oct;44(7):828-842. doi: 10.1177/0272989X241264572. Epub 2024 Jul 30.

DOI:10.1177/0272989X241264572
PMID:39077968
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11490068/
Abstract

PURPOSE

To develop a model that simulates radiologist assessments and use it to explore whether pairing readers based on their individual performance characteristics could optimize screening performance.

METHODS

Logistic regression models were designed and used to model individual radiologist assessments. For model evaluation, model-predicted individual performance metrics and paired disagreement rates were compared against the observed data using Pearson correlation coefficients. The logistic regression models were subsequently used to simulate different screening programs with reader pairing based on individual true-positive rates (TPR) and/or false-positive rates (FPR). For this, retrospective results from breast cancer screening programs employing double reading in Sweden, England, and Norway were used. Outcomes of random pairing were compared against those composed of readers with similar and opposite TPRs/FPRs, with positive assessments defined by either reader flagging an examination as abnormal.

RESULTS

The analysis data sets consisted of 936,621 (Sweden), 435,281 (England), and 1,820,053 (Norway) examinations. There was good agreement between the model-predicted and observed radiologists' TPR and FPR ( ≥ 0.969). Model-predicted negative-case disagreement rates showed high correlations ( ≥ 0.709), whereas positive-case disagreement rates had lower correlation levels due to sparse data ( ≥ 0.532). Pairing radiologists with similar FPR characteristics (Sweden: 4.50% [95% confidence interval: 4.46%-4.54%], England: 5.51% [5.47%-5.56%], Norway: 8.03% [7.99%-8.07%]) resulted in significantly lower FPR than with random pairing (Sweden: 4.74% [4.70%-4.78%], England: 5.76% [5.71%-5.80%], Norway: 8.30% [8.26%-8.34%]), reducing examinations sent to consensus/arbitration while the TPR did not change significantly. Other pairing strategies resulted in equal or worse performance than random pairing.

CONCLUSIONS

Logistic regression models accurately predicted screening mammography assessments and helped explore different radiologist pairing strategies. Pairing readers with similar modeled FPR characteristics reduced the number of examinations unnecessarily sent to consensus/arbitration without significantly compromising the TPR.

HIGHLIGHTS

A logistic-regression model can be derived that accurately predicts individual and paired reader performance during mammography screening reading.Pairing screening mammography radiologists with similar false-positive characteristics reduced false-positive rates with no significant loss in true positives and may reduce the number of examinations unnecessarily sent to consensus/arbitration.

摘要

目的

开发一种模拟放射科医生评估的模型,并利用该模型探讨根据个体表现特征对读者进行配对是否可以优化筛查性能。

方法

设计了逻辑回归模型来模拟个体放射科医生的评估。为了模型评估,使用皮尔逊相关系数将模型预测的个体性能指标和配对不一致率与观察数据进行比较。随后,使用逻辑回归模型根据个体的真阳性率(TPR)和/或假阳性率(FPR)模拟不同的筛查方案,并进行读者配对。为此,使用了瑞典、英国和挪威乳腺癌筛查项目的回顾性结果。将随机配对的结果与具有相似和相反 TPR/FPR 的读者配对的结果进行比较,阳性评估由任一读者将检查标记为异常来定义。

结果

分析数据集包括 936621 例(瑞典)、435281 例(英国)和 1820053 例(挪威)检查。模型预测的放射科医生 TPR 和 FPR 与观察结果具有很好的一致性(≥0.969)。模型预测的阴性病例不一致率具有很高的相关性(≥0.709),而阳性病例不一致率由于数据稀疏,相关性水平较低(≥0.532)。配对具有相似 FPR 特征的放射科医生(瑞典:4.50%[95%置信区间:4.46%-4.54%],英国:5.51%[5.47%-5.56%],挪威:8.03%[7.99%-8.07%])比随机配对(瑞典:4.74%[4.70%-4.78%],英国:5.76%[5.71%-5.80%],挪威:8.30%[8.26%-8.34%])显著降低了 FPR,同时 TPR 没有显著变化。其他配对策略的表现与随机配对相同或更差。

结论

逻辑回归模型准确预测了筛查乳房 X 线摄影评估,并有助于探讨不同的放射科医生配对策略。配对具有相似模型化 FPR 特征的读者可以减少不必要地发送到共识/仲裁的检查数量,而不会显著降低 TPR。

重点

可以得出一个能够准确预测乳房 X 线摄影筛查阅读中个体和配对读者表现的逻辑回归模型。

配对乳房 X 线摄影筛查放射科医生具有相似的假阳性特征,可以降低假阳性率,而不会显著降低真阳性率,并可能减少不必要地发送到共识/仲裁的检查数量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/1446d425f186/10.1177_0272989X241264572-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/6f87d720268c/10.1177_0272989X241264572-img2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/606d3ea6e5cb/10.1177_0272989X241264572-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/f314a8372878/10.1177_0272989X241264572-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/422fef53cf2f/10.1177_0272989X241264572-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/7aa3c21bf355/10.1177_0272989X241264572-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/170f3c40a4a4/10.1177_0272989X241264572-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/e892fde47336/10.1177_0272989X241264572-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/efcfd783d3eb/10.1177_0272989X241264572-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/1446d425f186/10.1177_0272989X241264572-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/6f87d720268c/10.1177_0272989X241264572-img2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/606d3ea6e5cb/10.1177_0272989X241264572-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/f314a8372878/10.1177_0272989X241264572-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/422fef53cf2f/10.1177_0272989X241264572-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/7aa3c21bf355/10.1177_0272989X241264572-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/170f3c40a4a4/10.1177_0272989X241264572-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/e892fde47336/10.1177_0272989X241264572-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/efcfd783d3eb/10.1177_0272989X241264572-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d60/11490068/1446d425f186/10.1177_0272989X241264572-fig8.jpg

相似文献

1
Modeling Radiologists' Assessments to Explore Pairing Strategies for Optimized Double Reading of Screening Mammograms.建立放射科医生的评估模型,探索优化筛查性乳房 X 光照片双读的配对策略。
Med Decis Making. 2024 Oct;44(7):828-842. doi: 10.1177/0272989X241264572. Epub 2024 Jul 30.
2
Optimizing the Pairs of Radiologists That Double Read Screening Mammograms.优化双人读片筛查乳腺 X 光片的放射科医生组合。
Radiology. 2023 Oct;309(1):e222691. doi: 10.1148/radiol.222691.
3
Influence of Mammography Volume on Radiologists' Performance: Results from BreastScreen Norway.乳腺钼靶摄影检查容积对放射科医生表现的影响:来自挪威乳腺筛查项目的结果。
Radiology. 2019 Aug;292(2):289-296. doi: 10.1148/radiol.2019182684. Epub 2019 May 28.
4
Effect of human variability on independent double reading in screening mammography.人类个体差异对乳腺钼靶筛查中独立双读的影响。
Acad Radiol. 1996 Nov;3(11):891-7. doi: 10.1016/s1076-6332(96)80296-0.
5
Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms.联合人工智能和放射科医生评估解读筛查性乳房 X 光照片的效果。
JAMA Netw Open. 2020 Mar 2;3(3):e200265. doi: 10.1001/jamanetworkopen.2020.0265.
6
Effect of radiologists' diagnostic work-up volume on interpretive performance.放射科医生的诊断检查工作量对解读表现的影响。
Radiology. 2014 Nov;273(2):351-64. doi: 10.1148/radiol.14132806. Epub 2014 Jun 24.
7
Discordant and false-negative interpretations at digital breast tomosynthesis in the prospective Oslo Tomosynthesis Screening Trial (OTST) using independent double reading.前瞻性奥斯陆断层合成筛查试验(OTST)中使用独立双读时数字乳腺断层合成的不一致和假阴性解读。
Eur Radiol. 2024 Jun;34(6):3912-3923. doi: 10.1007/s00330-023-10400-0. Epub 2023 Nov 8.
8
Double versus single reading of mammograms in a breast cancer screening programme: a cost-consequence analysis.乳腺癌筛查项目中乳腺钼靶片的双人阅片与单人阅片:成本效益分析
Eur Radiol. 2016 Sep;26(9):3262-71. doi: 10.1007/s00330-015-4175-4. Epub 2016 Jan 8.
9
Effect of integrating digital breast tomosynthesis (3D-mammography) with acquired or synthetic 2D-mammography on radiologists' true-positive and false-positive detection in a population screening trial: A descriptive study.数字乳腺断层合成摄影术(三维乳腺 X 线摄影术)与获取或合成的二维乳腺 X 线摄影术联合应用对人群筛查试验中放射科医生真阳性和假阳性检出的影响:一项描述性研究。
Eur J Radiol. 2018 Sep;106:26-31. doi: 10.1016/j.ejrad.2018.07.008. Epub 2018 Jul 10.
10
Performance of Radiologists and Radiographers in Double Reading Mammograms: The UK National Health Service Breast Screening Program.放射科医生和放射技师在双重读取乳腺 X 光片中的表现:英国国家医疗服务体系乳腺筛查计划。
Radiology. 2023 Jan;306(1):102-109. doi: 10.1148/radiol.212951. Epub 2022 Sep 13.

本文引用的文献

1
Optimizing the Pairs of Radiologists That Double Read Screening Mammograms.优化双人读片筛查乳腺 X 光片的放射科医生组合。
Radiology. 2023 Oct;309(1):e222691. doi: 10.1148/radiol.222691.
2
Possible strategies for use of artificial intelligence in screen-reading of mammograms, based on retrospective data from 122,969 screening examinations.基于 122969 次筛查检查的回顾性数据,探讨人工智能在乳腺 X 线摄影筛查阅读中的应用策略。
Eur Radiol. 2022 Dec;32(12):8238-8246. doi: 10.1007/s00330-022-08909-x. Epub 2022 Jun 15.
3
Screen-detected and interval breast cancer after concordant and discordant interpretations in a population based screening program using independent double reading.
基于独立双读的人群筛查项目中,对一致和不一致解读后的屏幕检出和间期乳腺癌。
Eur Radiol. 2022 Sep;32(9):5974-5985. doi: 10.1007/s00330-022-08711-9. Epub 2022 Apr 2.
4
A machine learning model based on readers' characteristics to predict their performances in reading screening mammograms.基于读者特征的机器学习模型,用于预测其阅读筛查性乳房 X 光片的表现。
Breast Cancer. 2022 Jul;29(4):589-598. doi: 10.1007/s12282-022-01335-3. Epub 2022 Feb 5.
5
Artificial intelligence (AI) to enhance breast cancer screening: protocol for population-based cohort study of cancer detection.人工智能(AI)增强乳腺癌筛查:基于人群队列研究癌症检出的方案。
BMJ Open. 2022 Jan 3;12(1):e054005. doi: 10.1136/bmjopen-2021-054005.
6
External Evaluation of 3 Commercial Artificial Intelligence Algorithms for Independent Assessment of Screening Mammograms.商业人工智能算法独立评估筛查性乳房 X 光照片的外部评估。
JAMA Oncol. 2020 Oct 1;6(10):1581-1588. doi: 10.1001/jamaoncol.2020.3321.
7
Range of Radiologist Performance in a Population-based Screening Cohort of 1 Million Digital Mammography Examinations.在 100 万例数字乳腺摄影筛查队列中放射科医生表现的范围。
Radiology. 2020 Oct;297(1):33-39. doi: 10.1148/radiol.2020192212. Epub 2020 Jul 28.
8
A Multi-million Mammography Image Dataset and Population-Based Screening Cohort for the Training and Evaluation of Deep Neural Networks-the Cohort of Screen-Aged Women (CSAW).一个包含数百万张乳腺 X 光图像的数据集和一个基于人群的筛查队列,用于深度学习神经网络的培训和评估 - 筛查年龄段女性队列(CSAW)。
J Digit Imaging. 2020 Apr;33(2):408-413. doi: 10.1007/s10278-019-00278-0.
9
Can eye-tracking metrics be used to better pair radiologists in a mammogram reading task?眼动追踪指标能否用于更好地在乳腺 X 光阅读任务中配对放射科医生?
Med Phys. 2018 Nov;45(11):4844-4856. doi: 10.1002/mp.13161. Epub 2018 Oct 1.
10
Benefits of Independent Double Reading in Digital Mammography: A Theoretical Evaluation of All Possible Pairing Methodologies.数字乳腺摄影独立双读的优势:所有可能配对方法的理论评估。
Acad Radiol. 2019 Jun;26(6):717-723. doi: 10.1016/j.acra.2018.06.017. Epub 2018 Jul 29.