• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度学习技术的自动口吃检测

Automated Stuttering Detection Using Deep Learning Techniques.

作者信息

Alhakbani Noura, Alnashwan Raghad, Al-Nafjan Abeer, Almudhi Abdulaziz

机构信息

Information Technology Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia.

Computer Science Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University, Riyadh 11432, Saudi Arabia.

出版信息

J Clin Med. 2025 May 19;14(10):3552. doi: 10.3390/jcm14103552.

DOI:10.3390/jcm14103552
PMID:40429548
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12111818/
Abstract

Disfluencies such as repetitions, prolongations, interjections, and blocks in sounds, syllables, or words can sometimes hinder communication. Currently, disfluencies are manually measured, which has inherent limitations, such as being time-consuming and subjective, which can lead to inconsistencies in measurement. To address these challenges, this study presents an innovative automated system for detecting disfluencies utilizing advanced artificial intelligence technologies; specifically, deep learning models such as convolutional neural networks (CNN) and convolutional long short-term memory (ConvLSTM). The system was evaluated using two benchmark datasets: FluencyBank and SEP-28K. Our proposed system demonstrates remarkable performance, achieving detection accuracies of 0.97 and 0.96, respectively, for CNNs and ConvLSTM models. These results not only exceed those of prior studies but also highlight the effectiveness of our approach in enhancing stuttering evaluation. : By providing a reliable and efficient tool for professionals in therapeutic settings, our system represents a significant advancement in the field, offering improved outcomes for individuals affected by stuttering.

摘要

诸如重复、延长、插入语以及声音、音节或单词中的停顿等言语不流畅有时会妨碍交流。目前,言语不流畅是通过人工测量的,这存在固有的局限性,比如耗时且主观,可能导致测量结果不一致。为应对这些挑战,本研究提出了一种利用先进人工智能技术检测言语不流畅的创新自动化系统;具体而言,是利用卷积神经网络(CNN)和卷积长短期记忆网络(ConvLSTM)等深度学习模型。该系统使用两个基准数据集进行了评估:FluencyBank和SEP - 28K。我们提出的系统表现出色,CNN模型和ConvLSTM模型的检测准确率分别达到了0.97和0.96。这些结果不仅超过了先前研究的结果,还突出了我们的方法在加强口吃评估方面的有效性。通过为治疗环境中的专业人员提供可靠且高效的工具,我们的系统代表了该领域的重大进步,为受口吃影响的个体带来了更好的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/421e95436ad1/jcm-14-03552-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/b622e3a4180f/jcm-14-03552-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/c26b7ba613e3/jcm-14-03552-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/d875b75ea7e1/jcm-14-03552-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/a2c19c2c2dc4/jcm-14-03552-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/47bd26712863/jcm-14-03552-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/421e95436ad1/jcm-14-03552-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/b622e3a4180f/jcm-14-03552-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/c26b7ba613e3/jcm-14-03552-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/d875b75ea7e1/jcm-14-03552-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/a2c19c2c2dc4/jcm-14-03552-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/47bd26712863/jcm-14-03552-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04c6/12111818/421e95436ad1/jcm-14-03552-g006.jpg

相似文献

1
Automated Stuttering Detection Using Deep Learning Techniques.使用深度学习技术的自动口吃检测
J Clin Med. 2025 May 19;14(10):3552. doi: 10.3390/jcm14103552.
2
Stuttering (Stammering)口吃
3
TranStutter: A Convolution-Free Transformer-Based Deep Learning Method to Classify Stuttered Speech Using 2D Mel-Spectrogram Visualization and Attention-Based Feature Representation.TransStutter:一种基于卷积的深度学习方法,使用 2D Mel 频谱图可视化和基于注意力的特征表示来分类口吃语音。
Sensors (Basel). 2023 Sep 22;23(19):8033. doi: 10.3390/s23198033.
4
Reduced duration of stuttering-like disfluencies and consistent anticipatory slowing during an adaptation task.在适应任务中,口吃样不流畅的持续时间减少,并且存在一致的预期减速。
Int J Lang Commun Disord. 2023 Nov-Dec;58(6):2162-2177. doi: 10.1111/1460-6984.12930. Epub 2023 Jul 13.
5
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
6
FluencyBank Timestamped: An Updated Data Set for Disfluency Detection and Automatic Intended Speech Recognition.流畅度银行时间戳数据集:用于不流畅检测和自动意图语音识别的更新数据集。
J Speech Lang Hear Res. 2024 Nov 7;67(11):4203-4215. doi: 10.1044/2024_JSLHR-24-00070. Epub 2024 Oct 8.
7
A Comprehensive Analysis of Speech Disfluencies in Autistic Young Adults and Control Young Adults: Group Differences in Typical, Stuttering-Like, and Atypical Disfluencies.自闭症青年与正常青年言语不流畅的综合分析:典型、类口吃和非典型不流畅的组间差异。
J Speech Lang Hear Res. 2023 Mar 7;66(3):832-848. doi: 10.1044/2022_JSLHR-22-00265. Epub 2023 Feb 10.
8
Effect of delayed auditory feedback on stuttering-like disfluencies.延迟听觉反馈对类口吃不流畅性的影响。
Int J Lang Commun Disord. 2025 Mar-Apr;60(2):e70007. doi: 10.1111/1460-6984.70007.
9
A case study of bilingual neurogenic stuttering: Measures of fluency, emotion, and articulation rate.双语神经性口吃的案例研究:流畅性、情绪及发音速率的测量
J Commun Disord. 2024 Jan-Feb;107:106385. doi: 10.1016/j.jcomdis.2023.106385. Epub 2023 Dec 8.
10
Unique Patterns of Bilingual Speech: Factors Affecting Disfluency Rates in Russian-Hebrew Bilingual Children.双语言语的独特模式:影响俄以双语儿童不流畅率的因素。
J Speech Lang Hear Res. 2023 Dec 11;66(12):4896-4912. doi: 10.1044/2023_JSLHR-23-00048. Epub 2023 Nov 6.

引用本文的文献

1
An Advanced CNN-Based Framework for the Automated Detection of Uncovered PET in Recycling Streams.一种基于先进卷积神经网络的框架,用于自动检测回收流中未覆盖的正电子发射断层扫描(PET)。
Polymers (Basel). 2025 Jun 22;17(13):1736. doi: 10.3390/polym17131736.

本文引用的文献

1
TranStutter: A Convolution-Free Transformer-Based Deep Learning Method to Classify Stuttered Speech Using 2D Mel-Spectrogram Visualization and Attention-Based Feature Representation.TransStutter:一种基于卷积的深度学习方法,使用 2D Mel 频谱图可视化和基于注意力的特征表示来分类口吃语音。
Sensors (Basel). 2023 Sep 22;23(19):8033. doi: 10.3390/s23198033.
2
Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning.通过数据增强、类别平衡损失和多上下文深度学习推进口吃检测。
IEEE J Biomed Health Inform. 2023 May;27(5):2553-2564. doi: 10.1109/JBHI.2023.3248281. Epub 2023 May 4.
3
An effective up-sampling approach for breast cancer prediction with imbalanced data: A machine learning model-based comparative analysis.
基于机器学习模型的不平衡数据乳腺癌预测的有效上采样方法:比较分析。
PLoS One. 2022 May 27;17(5):e0269135. doi: 10.1371/journal.pone.0269135. eCollection 2022.
4
Stuttering: Clinical and research update.口吃:临床与研究进展
Can Fam Physician. 2016 Jun;62(6):479-84.