Institute of Applied Health Research, University of Birmingham, Edgbaston, Birmingham B15 2TT, UK.
Warwick Medical School, University of Warwick, Coventry, UK.
BMJ. 2020 Feb 10;368:m127. doi: 10.1136/bmj.m127.
To examine the validity and findings of studies that examine the accuracy of algorithm based smartphone applications ("apps") to assess risk of skin cancer in suspicious skin lesions.
Systematic review of diagnostic accuracy studies.
Cochrane Central Register of Controlled Trials, MEDLINE, Embase, CINAHL, CPCI, Zetoc, Science Citation Index, and online trial registers (from database inception to 10 April 2019).
Studies of any design that evaluated algorithm based smartphone apps to assess images of skin lesions suspicious for skin cancer. Reference standards included histological diagnosis or follow-up, and expert recommendation for further investigation or intervention. Two authors independently extracted data and assessed validity using QUADAS-2 (Quality Assessment of Diagnostic Accuracy Studies 2 tool). Estimates of sensitivity and specificity were reported for each app.
Nine studies that evaluated six different identifiable smartphone apps were included. Six verified results by using histology or follow-up (n=725 lesions), and three verified results by using expert recommendations (n=407 lesions). Studies were small and of poor methodological quality, with selective recruitment, high rates of unevaluable images, and differential verification. Lesion selection and image acquisition were performed by clinicians rather than smartphone users. Two CE (Conformit Europenne) marked apps are available for download. SkinScan was evaluated in a single study (n=15, five melanomas) with 0% sensitivity and 100% specificity for the detection of melanoma. SkinVision was evaluated in two studies (n=252, 61 malignant or premalignant lesions) and achieved a sensitivity of 80% (95% confidence interval 63% to 92%) and a specificity of 78% (67% to 87%) for the detection of malignant or premalignant lesions. Accuracy of the SkinVision app verified against expert recommendations was poor (three studies).
Current algorithm based smartphone apps cannot be relied on to detect all cases of melanoma or other skin cancers. Test performance is likely to be poorer than reported here when used in clinically relevant populations and by the intended users of the apps. The current regulatory process for awarding the CE marking for algorithm based apps does not provide adequate protection to the public.
PROSPERO CRD42016033595.
评估基于算法的智能手机应用程序(“应用程序”)评估可疑皮肤病变皮肤癌风险的准确性的研究的有效性和结果。
诊断准确性研究的系统评价。
Cochrane 对照试验中心注册库、MEDLINE、Embase、CINAHL、CPCI、Zetoc、科学引文索引和在线试验登记处(从数据库创建到 2019 年 4 月 10 日)。
评估基于算法的智能手机应用程序以评估可疑皮肤癌皮肤病变图像的任何设计的研究。参考标准包括组织学诊断或随访,以及专家建议进一步调查或干预。两位作者独立提取数据并使用 QUADAS-2(诊断准确性研究的质量评估工具 2)评估有效性。为每个应用程序报告了敏感性和特异性的估计值。
共纳入了 9 项评估 6 种不同可识别智能手机应用程序的研究。6 项研究通过组织学或随访验证了结果(n=725 个病变),3 项研究通过专家建议验证了结果(n=407 个病变)。研究规模较小,方法学质量较差,存在选择性招募、高比例的不可评估图像以及差异验证。病变选择和图像采集由临床医生而不是智能手机用户进行。有两个 CE(欧洲合格评定)标记的应用程序可供下载。SkinScan 在一项研究中进行了评估(n=15,5 个黑色素瘤),黑色素瘤的检测敏感性为 0%,特异性为 100%。SkinVision 在两项研究中进行了评估(n=252,61 个恶性或癌前病变),对恶性或癌前病变的检测敏感性为 80%(95%置信区间 63%至 92%),特异性为 78%(67%至 87%)。SkinVision 应用程序针对专家建议的准确性评估结果不佳(三项研究)。
目前基于算法的智能手机应用程序不能依靠来检测所有黑色素瘤或其他皮肤癌病例。当在临床上相关的人群和应用程序的预期用户中使用时,测试性能可能比这里报告的要差。目前用于授予基于算法的应用程序 CE 标志的监管程序并不能为公众提供充分的保护。
PROSPERO CRD42016033595。