Google Health, Palo Alto, California.
Google Health via Advanced Clinical, Deerfield, Illinois.
JAMA Netw Open. 2021 Apr 1;4(4):e217249. doi: 10.1001/jamanetworkopen.2021.7249.
Most dermatologic cases are initially evaluated by nondermatologists such as primary care physicians (PCPs) or nurse practitioners (NPs).
To evaluate an artificial intelligence (AI)-based tool that assists with diagnoses of dermatologic conditions.
DESIGN, SETTING, AND PARTICIPANTS: This multiple-reader, multiple-case diagnostic study developed an AI-based tool and evaluated its utility. Primary care physicians and NPs retrospectively reviewed an enriched set of cases representing 120 different skin conditions. Randomization was used to ensure each clinician reviewed each case either with or without AI assistance; each clinician alternated between batches of 50 cases in each modality. The reviews occurred from February 21 to April 28, 2020. Data were analyzed from May 26, 2020, to January 27, 2021.
An AI-based assistive tool for interpreting clinical images and associated medical history.
The primary analysis evaluated agreement with reference diagnoses provided by a panel of 3 dermatologists for PCPs and NPs. Secondary analyses included diagnostic accuracy for biopsy-confirmed cases, biopsy and referral rates, review time, and diagnostic confidence.
Forty board-certified clinicians, including 20 PCPs (14 women [70.0%]; mean experience, 11.3 [range, 2-32] years) and 20 NPs (18 women [90.0%]; mean experience, 13.1 [range, 2-34] years) reviewed 1048 retrospective cases (672 female [64.2%]; median age, 43 [interquartile range, 30-56] years; 41 920 total reviews) from a teledermatology practice serving 11 sites and provided 0 to 5 differential diagnoses per case (mean [SD], 1.6 [0.7]). The PCPs were located across 12 states, and the NPs practiced in primary care without physician supervision across 9 states. The NPs had a mean of 13.1 (range, 2-34) years of experience and practiced in primary care without physician supervision across 9 states. Artificial intelligence assistance was significantly associated with higher agreement with reference diagnoses. For PCPs, the increase in diagnostic agreement was 10% (95% CI, 8%-11%; P < .001), from 48% to 58%; for NPs, the increase was 12% (95% CI, 10%-14%; P < .001), from 46% to 58%. In secondary analyses, agreement with biopsy-obtained diagnosis categories of maglignant, precancerous, or benign increased by 3% (95% CI, -1% to 7%) for PCPs and by 8% (95% CI, 3%-13%) for NPs. Rates of desire for biopsies decreased by 1% (95% CI, 0-3%) for PCPs and 2% (95% CI, 1%-3%) for NPs; the rate of desire for referrals decreased by 3% (95% CI, 1%-4%) for PCPs and NPs. Diagnostic agreement on cases not indicated for a dermatologist referral increased by 10% (95% CI, 8%-12%) for PCPs and 12% (95% CI, 10%-14%) for NPs, and median review time increased slightly by 5 (95% CI, 0-8) seconds for PCPs and 7 (95% CI, 5-10) seconds for NPs per case.
Artificial intelligence assistance was associated with improved diagnoses by PCPs and NPs for 1 in every 8 to 10 cases, indicating potential for improving the quality of dermatologic care.
大多数皮肤科病例最初由非皮肤科医生(如初级保健医生(PCP)或执业护士(NP))进行评估。
评估一种人工智能(AI)辅助工具,该工具可协助诊断皮肤科疾病。
设计、设置和参与者:本多读者、多案例诊断研究开发了一种人工智能工具,并评估了其效用。初级保健医生和执业护士回顾了一组代表 120 种不同皮肤状况的丰富病例。为确保每个临床医生都以或不使用 AI 辅助的方式查看每个病例,采用了随机化方法;每个临床医生在每个模式下交替进行 50 例批次。审查于 2020 年 2 月 21 日至 4 月 28 日进行。数据分析于 2020 年 5 月 26 日至 2021 年 1 月 27 日进行。
用于解释临床图像和相关病史的基于 AI 的辅助工具。
主要分析评估了与由 3 名皮肤科医生组成的小组提供的参考诊断的一致性,对象为 PCP 和 NP。次要分析包括活检确诊病例的诊断准确性、活检和转诊率、审查时间和诊断信心。
40 名认证临床医生,包括 20 名 PCP(14 名女性[70.0%];平均经验 11.3 [范围,2-32]年)和 20 名 NP(18 名女性[90.0%];平均经验 13.1 [范围,2-34]年)回顾了来自服务于 11 个地点的远程皮肤科实践的 1048 例回顾性病例(672 例女性[64.2%];中位年龄 43 [四分位距,30-56]岁;总共 41920 次审查),每个病例提供 0 至 5 种鉴别诊断(平均[标准差],1.6 [0.7])。PCP 分布在 12 个州,NP 在 9 个州没有医生监督的情况下从事初级保健工作。NP 的平均工作经验为 13.1 年(范围,2-34 年),并在 9 个州没有医生监督的情况下从事初级保健工作。人工智能辅助与参考诊断的更高一致性显著相关。对于 PCP,诊断一致性的提高为 10%(95%CI,8%-11%;P<0.001),从 48%增加到 58%;对于 NP,增加了 12%(95%CI,10%-14%;P<0.001),从 46%增加到 58%。在次要分析中,PCP 活检获得的恶性、癌前或良性诊断类别的一致性增加了 3%(95%CI,-1%至 7%),NP 增加了 8%(95%CI,3%-13%)。PCP 和 NP 对活检的需求意愿降低了 1%(95%CI,0-3%)和 2%(95%CI,1%-3%);转诊需求率下降了 3%(95%CI,1%-4%)。对于不需要皮肤科医生转诊的病例,PCP 和 NP 的诊断一致性增加了 10%(95%CI,8%-12%)和 12%(95%CI,10%-14%),PCP 和 NP 每例的平均审查时间略有增加 5 秒(95%CI,0-8 秒)和 7 秒(95%CI,5-10 秒)。
人工智能辅助对 PCP 和 NP 每 8 到 10 例中的 1 例进行了改进诊断,表明有可能改善皮肤科护理质量。