Delogu Pasquale, Evelina Fantacci Maria, Kasae Parnian, Retico Alessandra
Dipartimento di Fisica dell'Università di Pisa and INFN Sezionedi Pisa, Largo Pontecorvo 3, 56127 Pisa, Italy.
Comput Biol Med. 2007 Oct;37(10):1479-91. doi: 10.1016/j.compbiomed.2007.01.009. Epub 2007 Mar 26.
Computerized methods have recently shown a great potential in providing radiologists with a second opinion about the visual diagnosis of the malignancy of mammographic masses. The computer-aided diagnosis (CAD) system we developed for the mass characterization is mainly based on a segmentation algorithm and on the neural classification of several features computed on the segmented mass. Mass-segmentation plays a key role in most computerized systems. Our technique is a gradient-based one, showing the main characteristic that no free parameters have been evaluated on the data set used in this analysis, thus it can directly be applied to data sets acquired in different conditions without any ad hoc modification. A data set of 226 masses (109 malignant and 117 benign) has been used in this study. The segmentation algorithm works with a comparable efficiency both on malignant and benign masses. Sixteen features based on shape, size and intensity of the segmented masses are extracted and analyzed by a multi-layered perceptron neural network trained with the error back-propagation algorithm. The capability of the system in discriminating malignant from benign masses has been evaluated in terms of the receiver-operating characteristic (ROC) analysis. A feature selection procedure has been carried out on the basis of the feature discriminating power and of the linear correlations interplaying among them. The comparison of the areas under the ROC curves obtained by varying the number of features to be classified has shown that 12 selected features out of the 16 computed ones are powerful enough to achieve the best classifier performances. The radiologist assigned the segmented masses to three different categories: correctly-, acceptably- and non-acceptably-segmented masses. We initially estimated the area under ROC curve only on the first category of segmented masses (the 88.5% of the data set), then extending the classification to the second subclass (reaching the 97.8% of the data set) and finally to the whole data set, obtaining A(z)=0.805+/-0.030, 0.787+/-0.024 and 0.780+/-0.023, respectively.
计算机化方法近来在为放射科医生提供关于乳腺钼靶肿块恶性程度视觉诊断的第二种意见方面显示出巨大潜力。我们开发的用于肿块特征描述的计算机辅助诊断(CAD)系统主要基于一种分割算法以及对在分割后的肿块上计算出的多个特征进行神经分类。肿块分割在大多数计算机化系统中起着关键作用。我们的技术是基于梯度的,其主要特点是在本分析中使用的数据集上未评估任何自由参数,因此它可以直接应用于在不同条件下获取的数据集,而无需任何特殊修改。本研究使用了一个包含226个肿块(109个恶性和117个良性)的数据集。该分割算法在恶性和良性肿块上的工作效率相当。基于分割后肿块的形状、大小和强度提取了16个特征,并通过使用误差反向传播算法训练的多层感知器神经网络进行分析。该系统区分恶性和良性肿块的能力已通过接收者操作特征(ROC)分析进行评估。基于特征的辨别力及其相互之间的线性相关性进行了特征选择过程。通过改变要分类的特征数量获得的ROC曲线下面积的比较表明,从计算出的16个特征中选择的12个特征足以实现最佳分类器性能。放射科医生将分割后的肿块分为三个不同类别:正确分割、可接受分割和不可接受分割的肿块。我们最初仅在第一类分割后的肿块(数据集的88.5%)上估计ROC曲线下面积,然后将分类扩展到第二个子类(达到数据集的97.8%),最后扩展到整个数据集,分别得到A(z)=0.805±0.030、0.787±0.024和0.780±0.023。