Tomatis Stefano, Bono Aldo, Bartoli Cesare, Carrara Mauro, Lualdi Manuela, Tragni Gabrina, Marchesini Renato
Department of Medical Physics, Istituto Nazionale per lo Studio e la Cura dei Tumori, Milan, Italy.
Med Phys. 2003 Feb;30(2):212-21. doi: 10.1118/1.1538230.
Our aim in the present research is to investigate the diagnostic performance of artificial neural networks (ANNs) applied to multispectral images of cutaneous pigmented skin lesions as well as to compare this approach to a standard traditional linear classification method, such as discriminant function analysis. This study involves a series of 534 patients with 573 cutaneous pigmented lesions (132 melanomas and 441 nonmelanoma lesions). Each lesion was analyzed by a telespectrophotometric system (TS) in vivo, before surgery. The system is able to acquire a set of 17 images at selected wavelengths from 400 to 1040 nm. For each wavelength, five lesion descriptors were extracted, related to the criteria of the ABCD (for asymmetry, border, color, and dimension) clinical guide for melanoma diagnosis. These variables were first reduced in dimension by the use of factor analysis techniques and then used as input data in an ANN. Multivariate discriminant analysis (MDA) was also performed on the same dataset. The whole dataset was split into two independent groups: i.e., train (the first 400 cases, 95 melanomas) and verification set (last 173 cases, 37 melanomas). Factor analysis was able to summarize the data structure into ten variables, accounting for at least 90% of the original parameters variance. After proper training, the ANN was able to classify the population with 80% sensitivity, 72% specificity, and 78% sensitivity, 76% specificity for the train and validation set, respectively. Following ROC analysis, area under curve (AUC) was 0.852 (train) and 0.847 (verify). Sensitivity and specificity values obtained by the standard discriminant analysis classifier resulted in a figure of 80% sensitivity, 60% specificity and 76% sensitivity, 57% specificity for the train and validation set, respectively. AUC for MDA was 0.810 and 0.764 for the train and verify set, respectively. Classification results were significantly different between the two methods both for diagnostic scores and model stability, which was worse for MDA.
我们当前研究的目的是调查人工神经网络(ANN)应用于皮肤色素沉着病变多光谱图像的诊断性能,并将此方法与标准的传统线性分类方法(如判别函数分析)进行比较。本研究涉及534例患者的573处皮肤色素沉着病变(132例黑色素瘤和441例非黑色素瘤病变)。在手术前,通过遥测分光光度系统(TS)对每个病变进行体内分析。该系统能够在400至1040nm的选定波长下采集一组17张图像。对于每个波长,提取了五个与黑色素瘤诊断的ABCD(不对称、边界、颜色和尺寸)临床指南标准相关的病变描述符。这些变量首先通过因子分析技术进行降维,然后用作人工神经网络的输入数据。还对同一数据集进行了多变量判别分析(MDA)。整个数据集被分为两个独立的组:即训练组(前400例,95例黑色素瘤)和验证组(最后173例,37例黑色素瘤)。因子分析能够将数据结构总结为十个变量,占原始参数方差的至少90%。经过适当训练后,人工神经网络对训练组和验证组人群的分类灵敏度分别为80%、特异性为72%,以及灵敏度为78%、特异性为76%。经过ROC分析,曲线下面积(AUC)在训练组为0.852,在验证组为0.847。标准判别分析分类器获得的灵敏度和特异性值在训练组分别为80%灵敏度、60%特异性,在验证组分别为76%灵敏度、57%特异性。MDA在训练组和验证组的AUC分别为0.810和0.764。两种方法在诊断分数和模型稳定性方面的分类结果均存在显著差异,MDA的模型稳定性更差。