• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习识别沙特阿拉伯语儿童语音障碍的语音生物标志物

Machine Learning-Based Identification of Phonological Biomarkers for Speech Sound Disorders in Saudi Arabic-Speaking Children.

作者信息

Turki Deema F, Turki Ahmad F

机构信息

Speech and Hearing Pathology Department, Faculty of Medical Rehabilitation Sciences, King Abdulaziz University, Jeddah 21589, Saudi Arabia.

Electrical and Computer Engineering Department, Faculty of Engineering, King Abdulaziz University, Jeddah 21589, Saudi Arabia.

出版信息

Diagnostics (Basel). 2025 May 31;15(11):1401. doi: 10.3390/diagnostics15111401.

DOI:10.3390/diagnostics15111401
PMID:40506973
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12154118/
Abstract

This study investigates the application of machine learning (ML) techniques in diagnosing speech sound disorders (SSDs) in Saudi Arabic-speaking children, with a specific focus on phonological biomarkers, particularly Infrequent Variance (InfrVar), to improve diagnostic accuracy. SSDs are a significant concern in pediatric speech pathology, affecting an estimated 10-15% of preschool-aged children worldwide. However, accurate diagnosis remains challenging, especially in linguistically diverse populations. Traditional diagnostic tools, such as the Percentage of Consonants Correct (PCC), often fail to capture subtle phonological variations. This study explores the potential of machine learning models to enhance diagnostic accuracy by incorporating culturally relevant phonological biomarkers like InfrVar, aiming to develop a more effective diagnostic approach for SSDs in Saudi Arabic-speaking children. Data from 235 Saudi Arabic-speaking children aged 2;6 to 5;11 years were analyzed using several machine learning models: Random Forest, Support Vector Machine (SVM), XGBoost, Logistic Regression, K-Nearest Neighbors, and Naïve Bayes. The dataset was used to classify speech patterns into four categories: Atypical, Typical Development (TD), Articulation, and Delay. Phonological features such as Phonological Variance (PhonVar), InfrVar, and Percentage of Consonants Correct (PCC) were used as key variables. SHapley Additive exPlanations (SHAP) analysis was employed to interpret the contributions of individual features to model predictions. The XGBoost and Random Forest models demonstrated the highest performance, with an accuracy of 91.49% and an AUC of 99.14%. SHAP analysis revealed that articulation patterns and phonological patterns were the most influential features for distinguishing between Atypical and TD categories. The K-Means clustering approach identified four distinct subgroups based on speech development patterns: TD (46.61%), Articulation (25.42%), Atypical (18.64%), and Delay (9.32%). Machine learning models, particularly XGBoost and Random Forest, effectively classified speech development categories in Saudi Arabic-speaking children. This study highlights the importance of incorporating culturally specific phonological biomarkers like InfrVar and PhonVar to improve diagnostic precision for SSDs. These findings lay the groundwork for the development of AI-assisted diagnostic tools tailored to diverse linguistic contexts, enhancing early intervention strategies in pediatric speech pathology.

摘要

本研究调查了机器学习(ML)技术在诊断沙特阿拉伯语儿童语音障碍(SSD)中的应用,特别关注语音生物标志物,尤其是罕见变异(InfrVar),以提高诊断准确性。SSD是儿童言语病理学中的一个重要问题,估计全球有10 - 15%的学龄前儿童受其影响。然而,准确诊断仍然具有挑战性,尤其是在语言多样化的人群中。传统的诊断工具,如辅音正确率(PCC),往往无法捕捉到细微的语音变异。本研究探索了机器学习模型通过纳入如InfrVar等与文化相关的语音生物标志物来提高诊断准确性的潜力,旨在为沙特阿拉伯语儿童的SSD开发一种更有效的诊断方法。使用几种机器学习模型对235名年龄在2;6至5;11岁的沙特阿拉伯语儿童的数据进行了分析:随机森林、支持向量机(SVM)、XGBoost、逻辑回归、K近邻和朴素贝叶斯。该数据集用于将语音模式分为四类:非典型、典型发育(TD)、发音和延迟。语音变异(PhonVar)、InfrVar和辅音正确率(PCC)等语音特征被用作关键变量。采用SHapley加法解释(SHAP)分析来解释各个特征对模型预测的贡献。XGBoost和随机森林模型表现出最高的性能,准确率为91.49%,曲线下面积(AUC)为99.14%。SHAP分析表明,发音模式和语音模式是区分非典型和TD类别的最具影响力的特征。K均值聚类方法根据语音发展模式确定了四个不同的亚组:TD(46.61%)、发音(25.42%)、非典型(18.64%)和延迟(9.32%)。机器学习模型,特别是XGBoost和随机森林,有效地对沙特阿拉伯语儿童的语音发展类别进行了分类。本研究强调了纳入如InfrVar和PhonVar等特定文化的语音生物标志物以提高SSD诊断精度的重要性。这些发现为开发针对不同语言背景的人工智能辅助诊断工具奠定了基础,增强了儿童言语病理学中的早期干预策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/eaa0405366e0/diagnostics-15-01401-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/c7ec3826f1dd/diagnostics-15-01401-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/4452b51584f4/diagnostics-15-01401-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/3fae6bcec9d1/diagnostics-15-01401-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/eb7b7dcfbf5d/diagnostics-15-01401-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/eaa0405366e0/diagnostics-15-01401-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/c7ec3826f1dd/diagnostics-15-01401-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/4452b51584f4/diagnostics-15-01401-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/3fae6bcec9d1/diagnostics-15-01401-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/eb7b7dcfbf5d/diagnostics-15-01401-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e50/12154118/eaa0405366e0/diagnostics-15-01401-g005.jpg

相似文献

1
Machine Learning-Based Identification of Phonological Biomarkers for Speech Sound Disorders in Saudi Arabic-Speaking Children.基于机器学习识别沙特阿拉伯语儿童语音障碍的语音生物标志物
Diagnostics (Basel). 2025 May 31;15(11):1401. doi: 10.3390/diagnostics15111401.
2
Speech sound development in typically developing 2-7-year-old Dutch-speaking children: A normative cross-sectional study.荷兰语母语 2-7 岁儿童言语声音发展的正常横断面研究。
Int J Lang Commun Disord. 2020 Nov;55(6):971-987. doi: 10.1111/1460-6984.12575. Epub 2020 Oct 27.
3
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
4
The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.
5
Phonological development in school-aged Kuwaiti Arabic children with Down syndrome: A pilot study.患有唐氏综合征的科威特阿拉伯学龄儿童的语音发展:一项初步研究。
J Commun Disord. 2021 Sep-Oct;93:106128. doi: 10.1016/j.jcomdis.2021.106128. Epub 2021 Jun 2.
6
A survey of interventions used by speech-language pathologists for children with speech sound disorders in the Middle East.中东地区言语语言病理学家用于治疗儿童言语语音障碍的干预措施调查。
Logoped Phoniatr Vocol. 2023 Jul;48(2):67-74. doi: 10.1080/14015439.2021.1991469. Epub 2021 Dec 9.
7
Phonological and articulation treatment approaches in Portuguese children with speech and language impairments: a randomized controlled intervention study.葡萄牙语儿童言语语言障碍的语音和发音治疗方法:一项随机对照干预研究。
Int J Lang Commun Disord. 2013 Mar-Apr;48(2):172-87. doi: 10.1111/j.1460-6984.2012.00191.x. Epub 2012 Dec 4.
8
Phonological variations in typically-developing Italian-speaking children aged 3;0-4;11.3 岁零 0 个月至 4 岁 11 个月的意大利语母语儿童的语音变化。
Clin Linguist Phon. 2022 Mar 4;36(2-3):241-259. doi: 10.1080/02699206.2021.1945686. Epub 2021 Jul 22.
9
Identifying determinants of malnutrition in under-five children in Bangladesh: insights from the BDHS-2022 cross-sectional study.确定孟加拉国五岁以下儿童营养不良的决定因素:来自2022年孟加拉国人口与健康调查横断面研究的见解
Sci Rep. 2025 Apr 24;15(1):14336. doi: 10.1038/s41598-025-99288-y.
10
Diagnostic validity, accuracy and inter-rater reliability of a phonological assessment for Danish-speaking children.丹麦语儿童语音评估的诊断有效性、准确性和评分者间可靠性。
J Commun Disord. 2022 Jan-Feb;95:106168. doi: 10.1016/j.jcomdis.2021.106168. Epub 2021 Nov 16.

本文引用的文献

1
Machine learning in point-of-care testing: innovations, challenges, and opportunities.即时检验中的机器学习:创新、挑战与机遇
Nat Commun. 2025 Apr 2;16(1):3165. doi: 10.1038/s41467-025-58527-6.
2
Evaluating Heart Rate Variability as a Biomarker for Autonomic Function in Parkinson's Disease Rehabilitation: A Clustering-Based Analysis of Exercise-Induced Changes.评估心率变异性作为帕金森病康复中自主神经功能生物标志物:基于聚类的运动诱导变化分析
Medicina (Kaunas). 2025 Mar 17;61(3):527. doi: 10.3390/medicina61030527.
3
Image classification-driven speech disorder detection using deep learning technique.
使用深度学习技术的图像分类驱动的言语障碍检测
SLAS Technol. 2025 Jun;32:100261. doi: 10.1016/j.slast.2025.100261. Epub 2025 Mar 6.
4
Artificial Intelligence-Assisted Speech Therapy for /ɹ/: A Single-Case Experimental Study.人工智能辅助的/ɹ/音言语治疗:一项单病例实验研究。
Am J Speech Lang Pathol. 2024 Sep 18;33(5):2461-2486. doi: 10.1044/2024_AJSLP-23-00448. Epub 2024 Aug 22.
5
The development of a novel, standardized, norm-referenced Arabic Discourse Assessment Tool (ADAT), including an examination of psychometric properties of discourse measures in aphasia.开发一种新型、标准化、基于常模的阿拉伯语语篇评估工具(ADAT),包括评估失语症患者语篇测量的心理测量特性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):2103-2117. doi: 10.1111/1460-6984.13083. Epub 2024 Jun 18.
6
Exploring the Role of Machine Learning in Diagnosing and Treating Speech Disorders: A Systematic Literature Review.探索机器学习在言语障碍诊断与治疗中的作用:一项系统文献综述。
Psychol Res Behav Manag. 2024 May 31;17:2205-2232. doi: 10.2147/PRBM.S460283. eCollection 2024.
7
Enhancing Pediatric Adnexal Torsion Diagnosis: Prediction Method Utilizing Machine Learning Techniques.增强小儿附件扭转诊断:利用机器学习技术的预测方法
Children (Basel). 2023 Sep 27;10(10):1612. doi: 10.3390/children10101612.
8
Feature engineering and machine learning for computer-assisted screening of children with speech disorders.用于计算机辅助筛查言语障碍儿童的特征工程与机器学习
PLOS Digit Health. 2022 May 26;1(5):e0000041. doi: 10.1371/journal.pdig.0000041. eCollection 2022 May.
9
Intelligent speech technologies for transcription, disease diagnosis, and medical equipment interactive control in smart hospitals: A review.智能医院中用于转录、疾病诊断和医疗设备交互控制的智能语音技术:综述。
Comput Biol Med. 2023 Feb;153:106517. doi: 10.1016/j.compbiomed.2022.106517. Epub 2023 Jan 5.
10
Deep-Learning-Based Automated Classification of Chinese Speech Sound Disorders.基于深度学习的汉语语音障碍自动分类
Children (Basel). 2022 Jul 1;9(7):996. doi: 10.3390/children9070996.