Kendall Edward J, Flynn Matthew T
Discipline of Radiology, Memorial University of Newfoundland, St. John's, Newfoundland and Labrador, Canada.
PLoS One. 2014 Mar 14;9(3):e91015. doi: 10.1371/journal.pone.0091015. eCollection 2014.
This work aimed to improve breast screening program accuracy using automated classification. The goal was to determine if whole image features represented in the discrete cosine transform would provide a basis for classification. Priority was placed on avoiding false negative findings.
Online datasets were used for this work. No informed consent was required. Programs were developed in Mathematica and, where necessary to improve computational performance ported to C++. The use of a discrete cosine transform to separate normal from cancerous breast tissue was tested. Features (moments of the mean) were calculated in square sections of the transform centered on the origin. K-nearest neighbor and naive Bayesian classifiers were tested.
Forty-one features were generated and tested singly, and in combination of two or three. Using a k-nearest neighbor classifier, sensitivities as high as 98% with a specificity of 66% were achieved. With a naive Bayesian classifier, sensitivities as high as 100% were achieved with a specificity of 64%.
Whole image classification based on discrete cosine transform (DCT) features was effectively implemented with a high level of sensitivity and specificity achieved. The high sensitivity attained using the DCT generated feature set implied that these classifiers could be used in series with other methods to increase specificity. Using a classifier with near 100% sensitivity, such as the one developed in this project, before applying a second classifier could only boost the accuracy of that classifier.
本研究旨在通过自动分类提高乳腺筛查程序的准确性。目标是确定离散余弦变换中表示的全图像特征是否能为分类提供基础。重点是避免假阴性结果。
本研究使用在线数据集。无需知情同意。程序在Mathematica中开发,并在必要时移植到C++以提高计算性能。测试了使用离散余弦变换将正常乳腺组织与癌性乳腺组织分离的方法。在以原点为中心的变换方形区域中计算特征(均值矩)。测试了k近邻和朴素贝叶斯分类器。
生成并单独测试了41个特征,以及两个或三个特征的组合。使用k近邻分类器,灵敏度高达98%,特异性为66%。使用朴素贝叶斯分类器,灵敏度高达100%,特异性为64%。
基于离散余弦变换(DCT)特征的全图像分类得以有效实现,实现了较高的灵敏度和特异性。使用DCT生成的特征集获得的高灵敏度意味着这些分类器可与其他方法串联使用以提高特异性。在应用第二个分类器之前使用灵敏度接近100%的分类器(如本项目开发的分类器)只会提高该分类器的准确性。