Department of Electrical Engineering, Stanford University, Stanford, California.
Department of Dermatology, Stanford School of Medicine, Redwood City, California.
JAMA Dermatol. 2023 May 1;159(5):496-503. doi: 10.1001/jamadermatol.2023.0091.
Telemedicine use accelerated during the COVID-19 pandemic, and skin conditions were a common use case. However, many images submitted may be of insufficient quality for making a clinical determination.
To determine whether an artificial intelligence (AI) decision support tool, a machine learning algorithm, could improve the quality of images submitted for telemedicine by providing real-time feedback and explanations to patients.
DESIGN, SETTING, AND PARTICIPANTS: This quality improvement study with an AI performance component and single-arm clinical pilot study component was conducted from March 2020 to October 2021. After training, the AI decision support tool was tested on 357 retrospectively collected telemedicine images from Stanford telemedicine from March 2020 to June 2021. Subsequently, a single-arm clinical pilot study was conducted to assess feasibility with 98 patients in the Stanford Department of Dermatology across 2 clinical sites from July 2021 to October 2021. For the clinical pilot study, inclusion criteria for patients included being adults (aged ≥18 years), presenting to clinic for a skin condition, and being able to photograph their own skin with a smartphone.
During the clinical pilot study, patients were given a handheld smartphone device with a machine learning algorithm interface loaded and were asked to take images of any lesions of concern. Patients were able to review and retake photos prior to submitting, so each submitted photo met the patient's assumed standard of clinical acceptability. A machine learning algorithm then gave the patient feedback on whether the image was acceptable. If the image was rejected, the patient was provided a reason by the AI decision support tool and allowed to retake the photos.
The main outcome of the retrospective image analysis was the receiver operator curve area under the curve (ROC-AUC). The main outcome of the clinical pilot study was the image quality difference between the baseline images and the images approved by AI decision support.
Of the 98 patients included, the mean (SD) age was 49.8 (17.6) years, and 50 (51%) of the patients were male. On retrospective telemedicine images, the machine learning algorithm effectively identified poor-quality images (ROC-AUC of 0.78) and the reason for poor quality (blurry ROC-AUC of 0.84; lighting issues ROC-AUC of 0.70). The performance was consistent across age and sex. In the clinical pilot study, patient use of the machine learning algorithm was associated with improved image quality. An AI algorithm was associated with reduction in the number of patients with a poor-quality image by 68.0%.
In this quality improvement study, patients use of the AI decision support with a machine learning algorithm was associated with improved quality of skin disease photographs submitted for telemedicine use.
远程医疗在 COVID-19 大流行期间得到了加速应用,皮肤病是一个常见的应用案例。然而,许多提交的图像可能质量不够,无法做出临床诊断。
确定一种人工智能 (AI) 决策支持工具,一种机器学习算法,是否可以通过向患者提供实时反馈和解释来提高远程医疗提交图像的质量。
设计、设置和参与者:这是一项具有 AI 性能组件和单臂临床试点研究组件的质量改进研究,于 2020 年 3 月至 2021 年 10 月进行。在培训后,该 AI 决策支持工具在 2020 年 3 月至 2021 年 6 月斯坦福远程医疗的 357 张回顾性远程医疗图像上进行了测试。随后,2021 年 7 月至 2021 年 10 月,斯坦福皮肤科的 98 名患者在两个临床地点进行了单臂临床试点研究,以评估其可行性。对于临床试点研究,患者的纳入标准包括成年人(年龄≥18 岁)、因皮肤状况就诊和能够使用智能手机拍摄自己的皮肤。
在临床试点研究期间,患者获得了一台带有机器学习算法界面的手持智能手机,并被要求拍摄任何关注的病变的图像。患者可以在提交前查看和重新拍摄照片,因此提交的每张照片都符合患者假设的临床可接受标准。然后,机器学习算法会根据图像的质量向患者提供反馈。如果图像被拒绝,AI 决策支持工具会向患者提供拒绝的原因,并允许患者重新拍摄照片。
回顾性图像分析的主要结果是接收者操作曲线下的面积(ROC-AUC)。临床试点研究的主要结果是 AI 决策支持工具批准的基线图像和图像之间的图像质量差异。
在 98 名纳入的患者中,平均(SD)年龄为 49.8(17.6)岁,50 名(51%)患者为男性。在回顾性远程医疗图像中,机器学习算法能够有效地识别出低质量图像(ROC-AUC 为 0.78)和低质量的原因(模糊 ROC-AUC 为 0.84;光照问题 ROC-AUC 为 0.70)。该性能在年龄和性别方面表现一致。在临床试点研究中,患者使用机器学习算法与改善皮肤病照片的质量有关。AI 算法将图像质量差的患者数量减少了 68.0%。
在这项质量改进研究中,患者使用带有机器学习算法的 AI 决策支持与提高远程医疗使用的皮肤病照片质量有关。