Clinical Informatics, Interventional and Translational Solutions, Philips Research North America, 345 Scarborough Road, Briarcliff Manor, NY, 10510, USA,
J Digit Imaging. 2013 Oct;26(5):977-88. doi: 10.1007/s10278-013-9612-9.
Introduce the notion of cross-sectional relatedness as an informational dependence relation between sentences in the conclusion section of a breast radiology report and sentences in the findings section of the same report. Assess inter-rater agreement of breast radiologists. Develop and evaluate a support vector machine (SVM) classifier for automatically detecting cross-sectional relatedness. A standard reference is manually created from 444 breast radiology reports by the first author. A subset of 37 reports is annotated by five breast radiologists. Inter-rater agreement is computed among their annotations and standard reference. Thirteen numerical features are developed to characterize pairs of sentences; the optimal feature set is sought through forward selection. Inter-rater agreement is F-measure 0.623. SVM classifier has F-measure of 0.699 in the 12-fold cross-validation protocol against standard reference. Report length does not correlate with the classifier's performance (correlation coefficient = -0.073). SVM classifier has average F-measure of 0.505 against annotations by breast radiologists. Mediocre inter-rater agreement is possibly caused by: (1) definition is insufficiently actionable, (2) fine-grained nature of cross-sectional relatedness on sentence level, instead of, for instance, on paragraph level, and (3) higher-than-average complexity of 37-report sample. SVM classifier performs better against standard reference than against breast radiologists's annotations. This is supportive of (3). SVM's performance on standard reference is satisfactory. Since optimal feature set is not breast specific, results may transfer to non-breast anatomies. Applications include a smart report viewing environment and data mining.
介绍横断面相关性的概念,它是乳腺影像学报告的结论部分的句子与同一报告的发现部分的句子之间的一种信息依赖关系。评估乳腺放射科医生的组内一致性。开发和评估支持向量机(SVM)分类器,用于自动检测横断面相关性。第一作者通过 444 份乳腺放射学报告手动创建了一个标准参考。五位乳腺放射科医生对其中 37 份报告进行了注释。计算他们的注释和标准参考之间的组内一致性。开发了 13 个数值特征来描述句子对;通过前向选择寻求最佳特征集。组内一致性的 F 度量为 0.623。在针对标准参考的 12 折交叉验证协议中,SVM 分类器的 F 度量为 0.699。报告长度与分类器的性能不相关(相关系数=−0.073)。SVM 分类器对乳腺放射科医生注释的平均 F 度量为 0.505。中等的组内一致性可能是由于:(1)定义不够可操作,(2)句子层面上的横断面相关性的细粒度性质,而不是例如在段落层面上,以及(3)37 份报告样本的平均复杂度较高。SVM 分类器对标准参考的性能优于对乳腺放射科医生注释的性能。这支持(3)。SVM 在标准参考上的性能令人满意。由于最佳特征集不是专门针对乳腺的,因此结果可能会转移到非乳腺解剖结构。应用包括智能报告查看环境和数据挖掘。