Duda M, Haber N, Daniels J, Wall D P
Division of Systems Medicine, Department of Pediatrics, Stanford University, Stanford, CA, USA.
Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
Transl Psychiatry. 2017 May 16;7(5):e1133. doi: 10.1038/tp.2017.86.
Autism spectrum disorder (ASD) and attention deficit hyperactivity disorder (ADHD) together affect >10% of the children in the United States, but considerable behavioral overlaps between the two disorders can often complicate differential diagnosis. Currently, there is no screening test designed to differentiate between the two disorders, and with waiting times from initial suspicion to diagnosis upwards of a year, methods to quickly and accurately assess risk for these and other developmental disorders are desperately needed. In a previous study, we found that four machine-learning algorithms were able to accurately (area under the curve (AUC)>0.96) distinguish ASD from ADHD using only a small subset of items from the Social Responsiveness Scale (SRS). Here, we expand upon our prior work by including a novel crowdsourced data set of responses to our predefined top 15 SRS-derived questions from parents of children with ASD (n=248) or ADHD (n=174) to improve our model's capability to generalize to new, 'real-world' data. By mixing these novel survey data with our initial archival sample (n=3417) and performing repeated cross-validation with subsampling, we created a classification algorithm that performs with AUC=0.89±0.01 using only 15 questions.
自闭症谱系障碍(ASD)和注意力缺陷多动障碍(ADHD)在美国共同影响着超过10%的儿童,但这两种疾病之间存在大量行为重叠,这常常使鉴别诊断变得复杂。目前,尚无专门用于区分这两种疾病的筛查测试,且从最初怀疑到诊断的等待时间长达一年以上,因此迫切需要能够快速、准确评估这些及其他发育障碍风险的方法。在之前的一项研究中,我们发现四种机器学习算法能够仅使用社交反应量表(SRS)中的一小部分项目,准确地(曲线下面积(AUC)>0.96)将ASD与ADHD区分开来。在此,我们扩展了之前的工作,纳入了一个新的众包数据集,该数据集来自ASD儿童(n = 248)或ADHD儿童(n = 174)的父母对我们预先定义的15个源自SRS的问题的回答,以提高我们模型对新的“现实世界”数据的泛化能力。通过将这些新的调查数据与我们最初的存档样本(n = 3417)混合,并使用子采样进行重复交叉验证,我们创建了一种分类算法,该算法仅使用15个问题,AUC = 0.89±0.01。