School of Computing Science, Simon Fraser University, Burnaby, BC V5A 1S6, Canada.
Sci Rep. 2021 Apr 8;11(1):7769. doi: 10.1038/s41598-021-87064-7.
Automated machine learning approaches to skin lesion diagnosis from images are approaching dermatologist-level performance. However, current machine learning approaches that suggest management decisions rely on predicting the underlying skin condition to infer a management decision without considering the variability of management decisions that may exist within a single condition. We present the first work to explore image-based prediction of clinical management decisions directly without explicitly predicting the diagnosis. In particular, we use clinical and dermoscopic images of skin lesions along with patient metadata from the Interactive Atlas of Dermoscopy dataset (1011 cases; 20 disease labels; 3 management decisions) and demonstrate that predicting management labels directly is more accurate than predicting the diagnosis and then inferring the management decision ([Formula: see text] and [Formula: see text] improvement in overall accuracy and AUROC respectively), statistically significant at [Formula: see text]. Directly predicting management decisions also considerably reduces the over-excision rate as compared to management decisions inferred from diagnosis predictions (24.56% fewer cases wrongly predicted to be excised). Furthermore, we show that training a model to also simultaneously predict the seven-point criteria and the diagnosis of skin lesions yields an even higher accuracy (improvements of [Formula: see text] and [Formula: see text] in overall accuracy and AUROC respectively) of management predictions. Finally, we demonstrate our model's generalizability by evaluating on the publicly available MClass-D dataset and show that our model agrees with the clinical management recommendations of 157 dermatologists as much as they agree amongst each other.
基于图像的皮肤病变诊断自动化机器学习方法已接近皮肤科医生的水平。然而,目前的机器学习方法在建议管理决策时依赖于预测潜在的皮肤状况,从而推断出管理决策,而没有考虑到单一条件下可能存在的管理决策的可变性。我们首次探索了直接基于图像预测临床管理决策的方法,而无需明确预测诊断。具体来说,我们使用来自 Interactive Atlas of Dermoscopy 数据集的皮肤病变的临床和皮肤镜图像以及患者元数据(1011 例;20 种疾病标签;3 种管理决策),并证明直接预测管理标签比预测诊断然后推断管理决策更准确(整体准确性和 AUROC 分别提高了[Formula: see text]和[Formula: see text]),在[Formula: see text]上具有统计学意义。与从诊断预测推断的管理决策相比,直接预测管理决策还大大降低了过度切除率(错误预测需要切除的病例减少了 24.56%)。此外,我们还表明,训练一个模型同时预测七点标准和皮肤病变的诊断可以提高管理预测的准确性(整体准确性和 AUROC 分别提高了[Formula: see text]和[Formula: see text])。最后,我们通过评估公开可用的 MClass-D 数据集展示了我们模型的泛化能力,并表明我们的模型与 157 位皮肤科医生的临床管理建议的一致性与他们之间的一致性一样高。