Yuan Jianbo, Holtz Chester, Smith Tristram, Luo Jiebo
0000 0004 1936 9174grid.16416.34Department of Computer Science, University of Rochester, Rochester, 14627 NY USA.
0000 0004 1936 9166grid.412750.5School of Medicine and Dentistry, University of Rochester Medical Center, Rochester, 14642 NY USA.
EURASIP J Bioinform Syst Biol. 2017 Feb 1;2017:3. doi: 10.1186/s13637-017-0057-1. eCollection 2017 Dec.
Autism spectrum disorder (ASD) is a developmental disorder that significantly impairs patients' ability to perform normal social interaction and communication. Moreover, the diagnosis procedure of ASD is highly time-consuming, labor-intensive, and requires extensive expertise. Although there exists no known cure for ASD, there is consensus among clinicians regarding the importance of early intervention for the recovery of ASD patients. Therefore, to benefit autism patients by enhancing their access to treatments such as early intervention, we aim to develop a robust machine learning-based system for autism detection by using Natural Language Processing techniques based on information extracted from medical forms of potential ASD patients. Our detecting framework involves converting semi-structured and unstructured medical forms into digital format, preprocessing, learning document representation, and finally, classification. Testing results are evaluated against the ground truth set by expert clinicians and the proposed system achieve a 83.4% accuracy and 91.1% recall, which is very promising. The proposed ASD detection framework could significantly simplify and shorten the procedure of ASD diagnosis.
自闭症谱系障碍(ASD)是一种发育障碍,会严重损害患者进行正常社交互动和沟通的能力。此外,ASD的诊断过程非常耗时、费力,且需要广泛的专业知识。虽然目前尚无已知的ASD治愈方法,但临床医生对于早期干预对ASD患者康复的重要性已达成共识。因此,为了通过增加自闭症患者获得早期干预等治疗的机会使其受益,我们旨在利用自然语言处理技术,基于从潜在ASD患者的医疗表格中提取的信息,开发一个强大的基于机器学习的自闭症检测系统。我们的检测框架包括将半结构化和非结构化医疗表格转换为数字格式、预处理、学习文档表示,最后进行分类。测试结果根据专家临床医生设定的真实情况进行评估,所提出的系统实现了83.4%的准确率和91.1%的召回率,这非常有前景。所提出的ASD检测框架可以显著简化和缩短ASD诊断程序。