Suppr超能文献

乳腺钼靶检查数据集读数产生的某些性能值与临床审计结果高度相关。

Certain performance values arising from mammographic test set readings correlate well with clinical audit.

作者信息

Soh BaoLin Pauline, Lee Warwick Bruce, Mello-Thoms Claudia, Tapia Kriscia, Ryan John, Hung Wai Tak, Thompson Graham, Heard Rob, Brennan Patrick

机构信息

Department of Diagnostic Radiology, Singapore General Hospital, Singapore.

Cancer Institute NSW, Sydney, New South Wales, Australia.

出版信息

J Med Imaging Radiat Oncol. 2015 Aug;59(4):403-410. doi: 10.1111/1754-9485.12301. Epub 2015 Apr 1.

Abstract

INTRODUCTION

Test sets have been increasingly utilised to augment clinical audit in breast screening programmes; however, their relationship has never been satisfactorily understood. This study examined the relationship between mammographic test set performance and clinical audit data.

METHODS

Clinical audit data over a 2-year period was generated for each of 20 radiologists. Sixty mammographic examinations, consisting of 40 normal and 20 cancer cases, formed the test set. Readers located any identifiable cancer, and levels of confidence were scored from 2 to 5, where a score of 3 and above is considered a recall rating. Jackknifing free response operating characteristic (JAFROC) figure-of-merit (FOM), location sensitivity and specificity were calculated for individual readers and then compared with clinical audit values using Spearman's rho.

RESULTS

JAFROC FOM showed significant correlations to: recall rate at a first round of screening (r = 0.51; P = 0.02); rate of small invasive cancers per 10 000 reads (r = 0.5; P = 0.02); percentage of all cancers read that were not recalled (r = -0.51; P = 0.02); and sensitivity (r = 0.51; P = 0.02). Location sensitivity demonstrated significant correlations with: rate of small invasive cancers per 10 000 reads (r = 0.46; P = 0.04); rate of DCIS (ductal carcinoma in situ) per 10 000 reads (r = 0.44; P = 0.05); detection rate of all invasive cancers and DCIS per 10 000 reads (r = 0.54; P = 0.01); percentage of all cancers read that were not recalled (r = -0.57; P = 0.009); and sensitivity (r = 0.57; P = 0.009). No other significant relationships were noted.

CONCLUSION

Performance indicators from test set demonstrate significant correlations with specific aspects of clinical performance, although caution needs to be exercised when generalising test set specificity to the clinical situation.

摘要

引言

测试集已越来越多地用于加强乳腺筛查项目中的临床审计;然而,它们之间的关系从未得到令人满意的理解。本研究调查了乳房X光检查测试集性能与临床审计数据之间的关系。

方法

为20位放射科医生中的每一位生成了为期2年的临床审计数据。由40例正常病例和20例癌症病例组成的60次乳房X光检查构成了测试集。阅片者找出任何可识别的癌症,并将信心水平从2到5进行评分,其中3分及以上被视为召回评级。计算每位阅片者的留一法自由反应操作特征(JAFROC)品质因数(FOM)、定位敏感性和特异性,然后使用斯皮尔曼等级相关系数与临床审计值进行比较。

结果

JAFROC FOM与以下各项显示出显著相关性:第一轮筛查时的召回率(r = 0.51;P = 0.02);每10000次阅片中微小浸润癌的发生率(r = 0.5;P = 0.02);所有已阅癌症中未被召回的百分比(r = -0.51;P = 0.02);以及敏感性(r = 0.51;P = 0.02)。定位敏感性与以下各项显示出显著相关性:每10000次阅片中微小浸润癌的发生率(r = 0.46;P = 0.04);每10000次阅片中导管原位癌(DCIS)的发生率(r = 0.44;P = 0.05);每10000次阅片中所有浸润癌和DCIS的检出率(r = 0.54;P = 0.01);所有已阅癌症中未被召回的百分比(r = -0.57;P = 0.009);以及敏感性(r = 0.57;P = 0.009)。未发现其他显著关系。

结论

测试集的性能指标与临床性能的特定方面显示出显著相关性,尽管在将测试集特异性推广到临床情况时需要谨慎。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验