Shazman Shula, Carmel Julie, Itkin Maxim, Malitsky Sergey, Shalan Monia, Soreq Eyal, Elliott Evan, Lebow Maya, Kuperman Yael
Department of Mathematics and Computer Science, The Open University of Israel, Raanana 4353701, Israel.
Azrieli Faculty of Medicine, Bar Ilan University, Safed 1311502, Israel.
Metabolites. 2025 May 16;15(5):332. doi: 10.3390/metabo15050332.
Autism spectrum disorder (ASD) diagnosis traditionally relies on behavioral assessments, which can be subjective and often lead to delayed identification. Recent advances in metabolomics and machine learning offer promising alternatives for more objective and precise diagnostic approaches.
First-morning urine samples were collected from 52 children (32 with ASD and 20 neurotypical controls), aged 5.04 ± 1.87 and 5.50 ± 1.74 years, respectively. Using liquid chromatography-mass spectrometry (LC-MS), 293 metabolites were identified and categorized into 189 endogenous and 104 exogenous metabolites. Various machine learning classifiers (random forest, logistic regression, random tree, and naïve Bayes) were applied to differentiate ASD and control groups through 10-fold cross-validation.
The random forest classifier achieved 85% accuracy and an area under the curve (AUC) of 0.9 using all 293 metabolites. Classification based solely on endogenous metabolites yielded 85% accuracy and an AUC of 0.86, whereas using exogenous metabolites alone resulted in lower performance (71% accuracy and an AUC of 0.72).
This study demonstrates the potential of urine metabolomic profiling, particularly endogenous metabolites, as a complementary diagnostic tool for ASD. The high classification accuracy highlights the feasibility of developing assistive diagnostic methods based on metabolite profiles, although further research is needed to link these profiles to specific behavioral characteristics and ASD subtypes.
自闭症谱系障碍(ASD)的诊断传统上依赖于行为评估,这种评估可能具有主观性,且常常导致诊断延迟。代谢组学和机器学习的最新进展为更客观、精确的诊断方法提供了有前景的替代方案。
收集了52名儿童的晨尿样本,其中32名患有ASD,20名是神经典型对照儿童,年龄分别为5.04±1.87岁和5.50±1.74岁。使用液相色谱 - 质谱联用(LC-MS)技术,鉴定出293种代谢物,并将其分为189种内源性代谢物和104种外源性代谢物。应用多种机器学习分类器(随机森林、逻辑回归、随机树和朴素贝叶斯)通过10折交叉验证来区分ASD组和对照组。
使用全部293种代谢物时,随机森林分类器的准确率达到85%,曲线下面积(AUC)为0.9。仅基于内源性代谢物进行分类时,准确率为85%,AUC为0.86,而仅使用外源性代谢物时性能较低(准确率71%,AUC为0.72)。
本研究证明了尿液代谢组学分析,特别是内源性代谢物,作为ASD补充诊断工具的潜力。高分类准确率突出了基于代谢物谱开发辅助诊断方法的可行性,尽管需要进一步研究将这些谱与特定行为特征和ASD亚型联系起来。