Megerian Jonathan T, Dey Sangeeta, Melmed Raun D, Coury Daniel L, Lerner Marc, Nicholls Christopher J, Sohl Kristin, Rouhbakhsh Rambod, Narasimhan Anandhi, Romain Jonathan, Golla Sailaja, Shareef Safiullah, Ostrovsky Andrey, Shannon Jennifer, Kraft Colleen, Liu-Mayo Stuart, Abbas Halim, Gal-Szabo Diana E, Wall Dennis P, Taraman Sharief
CHOC Children's, Orange, CA, USA.
University of California, Irvine School of Medicine, Department of Pediatrics, Irvine, CA, USA.
NPJ Digit Med. 2022 May 5;5(1):57. doi: 10.1038/s41746-022-00598-6.
Autism spectrum disorder (ASD) can be reliably diagnosed at 18 months, yet significant diagnostic delays persist in the United States. This double-blinded, multi-site, prospective, active comparator cohort study tested the accuracy of an artificial intelligence-based Software as a Medical Device designed to aid primary care healthcare providers (HCPs) in diagnosing ASD. The Device combines behavioral features from three distinct inputs (a caregiver questionnaire, analysis of two short home videos, and an HCP questionnaire) in a gradient boosted decision tree machine learning algorithm to produce either an ASD positive, ASD negative, or indeterminate output. This study compared Device outputs to diagnostic agreement by two or more independent specialists in a cohort of 18-72-month-olds with developmental delay concerns (425 study completers, 36% female, 29% ASD prevalence). Device output PPV for all study completers was 80.8% (95% confidence intervals (CI), 70.3%-88.8%) and NPV was 98.3% (90.6%-100%). For the 31.8% of participants who received a determinate output (ASD positive or negative) Device sensitivity was 98.4% (91.6%-100%) and specificity was 78.9% (67.6%-87.7%). The Device's indeterminate output acts as a risk control measure when inputs are insufficiently granular to make a determinate recommendation with confidence. If this risk control measure were removed, the sensitivity for all study completers would fall to 51.6% (63/122) (95% CI 42.4%, 60.8%), and specificity would fall to 18.5% (56/303) (95% CI 14.3%, 23.3%). Among participants for whom the Device abstained from providing a result, specialists identified that 91% had one or more complex neurodevelopmental disorders. No significant differences in Device performance were found across participants' sex, race/ethnicity, income, or education level. For nearly a third of this primary care sample, the Device enabled timely diagnostic evaluation with a high degree of accuracy. The Device shows promise to significantly increase the number of children able to be diagnosed with ASD in a primary care setting, potentially facilitating earlier intervention and more efficient use of specialist resources.
自闭症谱系障碍(ASD)在18个月大时即可得到可靠诊断,但在美国,显著的诊断延迟现象依然存在。这项双盲、多中心、前瞻性、活性对照队列研究测试了一款基于人工智能的作为医疗设备的软件在辅助初级保健医疗服务提供者(HCP)诊断ASD方面的准确性。该设备在梯度提升决策树机器学习算法中结合了来自三种不同输入(一份照顾者问卷、两段简短家庭视频的分析以及一份HCP问卷)的行为特征,以产生ASD阳性、ASD阴性或不确定的输出结果。本研究将该设备的输出结果与一组年龄在18至72个月、有发育迟缓问题的儿童(425名研究完成者,36%为女性,ASD患病率为29%)中两名或更多独立专家的诊断一致性进行了比较。所有研究完成者的设备输出阳性预测值为80.8%(95%置信区间(CI),70.3% - 88.8%),阴性预测值为98.3%(90.6% - 100%)。对于31.8%收到确定性输出结果(ASD阳性或阴性)的参与者,设备的灵敏度为98.4%(91.6% - 100%),特异性为78.9%(67.6% - 87.7%)。当输入不够细致以至于无法自信地做出确定性推荐时,该设备的不确定输出起到了风险控制措施的作用。如果去除这一风险控制措施,所有研究完成者的灵敏度将降至51.6%(63/122)(95% CI 42.4%,60.8%),特异性将降至18.5%(56/303)(95% CI 14.3%,23.3%)。在该设备未给出结果的参与者中,专家们发现91%的人有一种或多种复杂的神经发育障碍。在参与者的性别、种族/族裔、收入或教育水平方面,未发现设备性能存在显著差异。对于近三分之一的初级保健样本,该设备能够以高度准确性进行及时的诊断评估。该设备有望显著增加在初级保健环境中能够被诊断为ASD的儿童数量, potentially facilitating earlier intervention and more efficient use of specialist resources.(此处英文原文有误,正确翻译为“这可能有助于更早地进行干预,并更有效地利用专家资源”)