乳腺放射报告中句子的横断面相关性：支持向量机分类器的开发及其与五位乳腺放射科医生注释的评估。

Cross-sectional relatedness between sentences in breast radiology reports: development of an SVM classifier and evaluation against annotations of five breast radiologists.

机构信息

Clinical Informatics, Interventional and Translational Solutions, Philips Research North America, 345 Scarborough Road, Briarcliff Manor, NY, 10510, USA,

出版信息

J Digit Imaging. 2013 Oct;26(5):977-88. doi: 10.1007/s10278-013-9612-9.

DOI:10.1007/s10278-013-9612-9

PMID:23817629

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3782592/

Abstract

Introduce the notion of cross-sectional relatedness as an informational dependence relation between sentences in the conclusion section of a breast radiology report and sentences in the findings section of the same report. Assess inter-rater agreement of breast radiologists. Develop and evaluate a support vector machine (SVM) classifier for automatically detecting cross-sectional relatedness. A standard reference is manually created from 444 breast radiology reports by the first author. A subset of 37 reports is annotated by five breast radiologists. Inter-rater agreement is computed among their annotations and standard reference. Thirteen numerical features are developed to characterize pairs of sentences; the optimal feature set is sought through forward selection. Inter-rater agreement is F-measure 0.623. SVM classifier has F-measure of 0.699 in the 12-fold cross-validation protocol against standard reference. Report length does not correlate with the classifier's performance (correlation coefficient = -0.073). SVM classifier has average F-measure of 0.505 against annotations by breast radiologists. Mediocre inter-rater agreement is possibly caused by: (1) definition is insufficiently actionable, (2) fine-grained nature of cross-sectional relatedness on sentence level, instead of, for instance, on paragraph level, and (3) higher-than-average complexity of 37-report sample. SVM classifier performs better against standard reference than against breast radiologists's annotations. This is supportive of (3). SVM's performance on standard reference is satisfactory. Since optimal feature set is not breast specific, results may transfer to non-breast anatomies. Applications include a smart report viewing environment and data mining.

摘要

介绍横断面相关性的概念，它是乳腺影像学报告的结论部分的句子与同一报告的发现部分的句子之间的一种信息依赖关系。评估乳腺放射科医生的组内一致性。开发和评估支持向量机（SVM）分类器，用于自动检测横断面相关性。第一作者通过 444 份乳腺放射学报告手动创建了一个标准参考。五位乳腺放射科医生对其中 37 份报告进行了注释。计算他们的注释和标准参考之间的组内一致性。开发了 13 个数值特征来描述句子对；通过前向选择寻求最佳特征集。组内一致性的 F 度量为 0.623。在针对标准参考的 12 折交叉验证协议中，SVM 分类器的 F 度量为 0.699。报告长度与分类器的性能不相关（相关系数=−0.073）。SVM 分类器对乳腺放射科医生注释的平均 F 度量为 0.505。中等的组内一致性可能是由于：（1）定义不够可操作，（2）句子层面上的横断面相关性的细粒度性质，而不是例如在段落层面上，以及（3）37 份报告样本的平均复杂度较高。SVM 分类器对标准参考的性能优于对乳腺放射科医生注释的性能。这支持（3）。SVM 在标准参考上的性能令人满意。由于最佳特征集不是专门针对乳腺的，因此结果可能会转移到非乳腺解剖结构。应用包括智能报告查看环境和数据挖掘。

相似文献

Cross-sectional relatedness between sentences in breast radiology reports: development of an SVM classifier and evaluation against annotations of five breast radiologists.

J Digit Imaging. 2013 Oct;26(5):977-88. doi: 10.1007/s10278-013-9612-9.

Automated extraction of BI-RADS final assessment categories from radiology reports with natural language processing.

J Digit Imaging. 2013 Oct;26(5):989-94. doi: 10.1007/s10278-013-9616-5.

Classification of radiology reports for falls in an HIV study cohort.

J Am Med Inform Assoc. 2016 Apr;23(e1):e113-7. doi: 10.1093/jamia/ocv155. Epub 2015 Nov 13.

Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.

AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.

Increasing the Efficiency on Producing Radiology Reports for Breast Cancer Diagnosis by Means of Structured Reports. A Comparative Study.

Methods Inf Med. 2017 May 18;56(3):248-260. doi: 10.3414/ME16-01-0091. Epub 2017 Feb 21.

Observer agreement using the ACR Breast Imaging Reporting and Data System (BI-RADS)-ultrasound, First Edition (2003).

Korean J Radiol. 2007 Sep-Oct;8(5):397-402. doi: 10.3348/kjr.2007.8.5.397.

Imaging Phenotypes in Women at High Risk for Breast Cancer on Mammography, Ultrasound, and Magnetic Resonance Imaging Using the Fifth Edition of the Breast Imaging Reporting and Data System.

Eur J Radiol. 2018 Sep;106:150-159. doi: 10.1016/j.ejrad.2018.07.026. Epub 2018 Jul 30.

Automated annotation and classification of BI-RADS assessment from radiology reports.

J Biomed Inform. 2017 May;69:177-187. doi: 10.1016/j.jbi.2017.04.011. Epub 2017 Apr 18.

Introducing New Measures of Inter- and Intra-Rater Agreement to Assess the Reliability of Medical Ground Truth.

Stud Health Technol Inform. 2020 Jun 16;270:282-286. doi: 10.3233/SHTI200167.

Learning regular expressions for clinical text classification.

J Am Med Inform Assoc. 2014 Sep-Oct;21(5):850-7. doi: 10.1136/amiajnl-2013-002411. Epub 2014 Feb 27.

引用本文的文献

Automatically pairing measured findings across narrative abdomen CT reports.

AMIA Annu Symp Proc. 2013 Nov 16;2013:1262-71. eCollection 2013.

本文引用的文献

Critical finding capture in the impression section of radiology reports.

AMIA Annu Symp Proc. 2011;2011:465-9. Epub 2011 Oct 22.

Automatically correlating clinical findings and body locations in radiology reports using MedLEE.

J Digit Imaging. 2012 Apr;25(2):240-9. doi: 10.1007/s10278-011-9411-0.

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.

Customization of medical report data.

J Digit Imaging. 2010 Aug;23(4):363-73. doi: 10.1007/s10278-010-9307-4.

An overview of MetaMap: historical perspective and recent advances.

J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.

Use of Radcube for extraction of finding trends in a large radiology practice.

J Digit Imaging. 2009 Dec;22(6):629-40. doi: 10.1007/s10278-008-9128-x. Epub 2008 Jun 10.

Automated acquisition of disease drug knowledge from biomedical and clinical documents: an initial study.

J Am Med Inform Assoc. 2008 Jan-Feb;15(1):87-98. doi: 10.1197/jamia.M2401. Epub 2007 Oct 18.

Knowledge discovery from structured mammography reports using inductive logic programming.

AMIA Annu Symp Proc. 2005;2005:96-100.

Bias in error estimation when using cross-validation for model selection.

BMC Bioinformatics. 2006 Feb 23;7:91. doi: 10.1186/1471-2105-7-91.

Agreement, the f-measure, and reliability in information retrieval.

J Am Med Inform Assoc. 2005 May-Jun;12(3):296-8. doi: 10.1197/jamia.M1733. Epub 2005 Jan 31.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

乳腺放射报告中句子的横断面相关性：支持向量机分类器的开发及其与五位乳腺放射科医生注释的评估。

Cross-sectional relatedness between sentences in breast radiology reports: development of an SVM classifier and evaluation against annotations of five breast radiologists.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献