Pan Ning, Chen Lifeng, Wu Bocheng, Chen Fangfang, Chen Jin, Huang Saijun, Guo Cuihua, Wu Jinqing, Wang Yujie, Chen Xian, Yang Shirui, Jing Jin, Weng Xuchu, Lin Lizi, Liang Jiuxing, Wang Xin
Key Laboratory of Brain, Cognition and Education Sciences, Ministry of Education, Institute for Brain Research and Rehabilitation, Guangdong Key Laboratory of Mental Health and Cognitive Science, South China Normal University, Guangzhou 510630, China.
Department of General Education, Guangzhou Huali College, Guangzhou 511325, China.
Psychiatry Res. 2025 Feb;344:116353. doi: 10.1016/j.psychres.2025.116353. Epub 2025 Jan 3.
Early screening for autism spectrum disorder (ASD) is crucial, yet current assessment tools in Chinese primary child care are limited in efficacy.
This study aims to employ machine learning algorithms to identify key indicators from the 20-item Modified Checklist for Autism in Toddlers, revised (M-CHAT-R) combining with ASD-related sociodemographic and environmental factors, to distinguish ASD from typically developing children.
Data from our prior validation study of the Chinese M-CHAT-R (August 2016-March 2017, n = 6,049 toddlers) were reviewed. We extracted the 20-item M-CHAT-R data and integrated 17 sociodemographic and environmental risk factors associated with ASD development to strengthen M-CHAT-R's machine learning screening. Five feature selection methods were used to extract subsets from the original set. Six machine learning algorithms were applied to identify the optimal subset distinguishing clinically diagnosed ASD toddlers from typically developing toddlers.
Nine features were grouped into three subsets: subset 1 contained unanimously recommended items (A1 [Follows point], A3 [Pretend play], A9 [Brings objects to show], A10 [Response to name] and A16 [Gazing following]). Subset 2 added two items (A17 [Gaining parent's attention] and A18 [Understands what is said]), and subset 3 included two more items (A8 [Interest in other children] and child's age). The top-performing algorithm resulted in a seven-item classifier of subset 2 with 92.5 % sensitivity, 90.1 % specificity, and 10.0 % positive predictive value.
Machine learning classifiers effectively differentiate ASD toddlers from typically developing toddlers using a reduced M-CHAT-R item set.
This highlights the clinical significance of machine learning-optimized models for ASD screening in primary health care centers and broader applications.
自闭症谱系障碍(ASD)的早期筛查至关重要,但中国基层儿童保健机构目前的评估工具效果有限。
本研究旨在运用机器学习算法,从20项修订版幼儿自闭症修正检查表(M-CHAT-R)中识别关键指标,并结合与ASD相关的社会人口学和环境因素,以区分ASD儿童与发育正常的儿童。
回顾了我们之前对中文版M-CHAT-R进行验证研究(2016年8月至2017年3月,n = 6049名幼儿)的数据。我们提取了20项M-CHAT-R数据,并整合了17项与ASD发展相关的社会人口学和环境风险因素,以加强M-CHAT-R的机器学习筛查。使用五种特征选择方法从原始数据集中提取子集。应用六种机器学习算法来识别区分临床诊断的ASD幼儿与发育正常幼儿的最佳子集。
九个特征被分为三个子集:子集1包含一致推荐的项目(A1[听从指令]、A3[假装游戏]、A9[拿东西展示]、A10[对名字的反应]和A16[目光追随])。子集2增加了两个项目(A17[引起家长注意]和A18[理解所说内容]),子集3又增加了两个项目(A8[对其他孩子的兴趣]和孩子的年龄)。表现最佳的算法产生了一个子集2的七项分类器,灵敏度为92.5%,特异度为90.1%,阳性预测值为10.0%。
机器学习分类器使用精简的M-CHAT-R项目集有效地将ASD幼儿与发育正常的幼儿区分开来。
这凸显了机器学习优化模型在基层医疗保健中心进行ASD筛查及更广泛应用的临床意义。