Yuwattana Wasana, Saeliw Thanit, van Erp Marlieke Lisanne, Poolcharoen Chayanit, Kanlayaprasit Songphon, Trairatvorakul Pon, Chonchaiya Weerasak, Hu Valerie W, Sarachana Tewarit
The Ph.D. Program in Clinical Biochemistry and Molecular Medicine, Department of Clinical Chemistry, Faculty of Allied Health Sciences, Chulalongkorn University, Bangkok, 10330, Thailand.
Chulalongkorn Autism Research and Innovation Center of Excellence (ChulaACE), Department of Clinical Chemistry, Faculty of Allied Health Sciences, Chulalongkorn University, Bangkok, 10330, Thailand.
Sci Rep. 2025 Apr 5;15(1):11712. doi: 10.1038/s41598-025-95291-5.
Autism spectrum disorder (ASD) presents significant challenges in diagnosis and intervention due to its diverse clinical manifestations and underlying biological complexity. This study explored machine learning approaches to enhance ASD screening accuracy and identify meaningful subtypes using clinical assessments from AGRE database integrated with molecular data from GSE15402. Analysis of ADI-R scores from a large cohort of 2794 individuals demonstrated that deep learning models could achieve exceptional screening accuracy of 95.23% (CI 94.32-95.99%). Notably, comparable performance was maintained using a streamlined set of just 27 ADI-R sub-items, suggesting potential for more efficient diagnostic tools. Clustering analyses revealed three distinct subgroups identifiable through both clinical symptoms and gene expression patterns. When ASD were grouped based on clinical features, stronger associations emerged between symptoms and underlying molecular profiles compared to grouping based on gene expression alone. These findings suggest that starting with detailed clinical observations may be more effective for identifying biologically meaningful ASD subtypes than beginning with molecular data. This integrated approach combining clinical and molecular data through machine learning offers promising directions for developing more precise screening methods and personalized intervention strategies for individuals with ASD.
自闭症谱系障碍(ASD)因其多样的临床表现和潜在的生物学复杂性,在诊断和干预方面面临重大挑战。本研究探索了机器学习方法,以提高ASD筛查的准确性,并使用来自AGRE数据库的临床评估数据与来自GSE15402的分子数据相结合,来识别有意义的亚型。对2794名个体的大型队列的ADI-R评分分析表明,深度学习模型可以实现95.23%(置信区间94.32 - 95.99%)的卓越筛查准确率。值得注意的是,仅使用精简后的27个ADI-R子项目也能保持相当的性能,这表明更高效的诊断工具具有潜力。聚类分析揭示了通过临床症状和基因表达模式可识别的三个不同亚组。当根据临床特征对ASD进行分组时,与仅基于基因表达分组相比,症状与潜在分子特征之间出现了更强的关联。这些发现表明,从详细的临床观察开始可能比从分子数据开始更有效地识别具有生物学意义的ASD亚型。这种通过机器学习将临床和分子数据相结合的综合方法,为开发更精确的筛查方法和针对ASD个体的个性化干预策略提供了有前景的方向。