Miller Ian J, Stapelberg Michael, Hudson Jeremy, Coxon Paul, Milani Nathaniel, Rosic Nedeljka, Furness James, Walsh Joe, Climstein Mike
Aquatic Based Research, Faculty of Health, Southern Cross University, Bilinga, Qld, Australia.
Skin Clinic Robina, Robina, Qld, Australia.
PeerJ. 2025 Aug 28;13:e19876. doi: 10.7717/peerj.19876. eCollection 2025.
Artificial intelligence (AI) is poised to revolutionise how melanoma is detected in clinical practice, yet few studies have been published with patient data at the forefront.
The primary aim of this study was to investigate the clinical performance of a market-approved convolutional neural network (CNN) to better differentiate skin lesions suspicious of being malignant melanoma (MM). A secondary aim of this study was to compare the diagnostic performance of the CNN across two separate general practices, that are skin cancer focused clinics.
Multicentre, cross-sectional study using a commercially available CNN on 373 melanocytic lesions (114 melanoma, 259 non-melanoma) from participants attending a skin examination within two Australian specialised, general practice clinics. Performance metrics included sensitivity, specificity, predictive values, diagnostic odds ratios, accuracy and area under the curve (AUC) of receiver operating characteristics (ROC) used for classification of images.
The CNN average sensitivity [Gold Coast Townsville] was calculated as 63.2% [61.5% 68.6%], specificity as 53.9% [52.5% 55.1%], positive predictive value as 37.8% [28.9% 44.0%] and negative predictive value as 76.8% [71.4% 84.2%]. Likelihood ratios were 1.4 for positive likelihood ratio, 0.7 for negative likelihood ratio and a diagnostic odds ratio of 2.0 across both clinics. Accuracy was calculated as 56.6% [56.1% 57.5%] and the AUC of ROC for both clinics was 0.602 and 0.615 for Townsville and Gold Coast, respectively.
Improvement of the performance of this CNN for the classification of images, particularly when suspecting MM is necessary before it may be used in a clinical setting in Australia. Other validated AI systems used internationally may also require review for use in an Australian setting.
人工智能(AI)有望彻底改变临床实践中黑色素瘤的检测方式,但以患者数据为核心的已发表研究却很少。
本研究的主要目的是调查一种市场认可的卷积神经网络(CNN)的临床性能,以更好地区分疑似恶性黑色素瘤(MM)的皮肤病变。本研究的次要目的是比较该CNN在两个独立的、以皮肤癌为重点的普通诊所的诊断性能。
多中心横断面研究,在澳大利亚两家专门的普通诊所中,使用市售的CNN对373例黑素细胞病变(114例黑色素瘤,259例非黑色素瘤)进行研究。性能指标包括用于图像分类的敏感性、特异性、预测值、诊断比值比、准确性以及接受者操作特征曲线(ROC)下的面积(AUC)。
CNN的平均敏感性[黄金海岸 汤斯维尔]计算为63.2%[61.5% 68.6%],特异性为53.9%[52.5% 55.1%],阳性预测值为37.8%[28.9% 44.0%],阴性预测值为76.8%[71.4% 84.2%]。两家诊所的阳性似然比为1.4,阴性似然比为0.7,诊断比值比为2.0。准确性计算为56.6%[56.1% 57.5%],汤斯维尔和黄金海岸两家诊所的ROC曲线下面积分别为0.602和0.615。
在澳大利亚将该CNN用于临床之前,有必要提高其图像分类性能,尤其是在怀疑为MM时。国际上使用的其他经过验证的AI系统在澳大利亚使用时可能也需要进行评估。