• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发一种人工智能工具,以预测初级保健环境中的声带病变。

Developing an Artificial Intelligence Tool to Predict Vocal Cord Pathology in Primary Care Settings.

机构信息

Section of Otolaryngology-Head and Neck Surgery, Department of Surgery, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.

Department of Data Science and Analytics, Faculty of Science, University of Calgary, Calgary, Alberta, Canada.

出版信息

Laryngoscope. 2023 Aug;133(8):1952-1960. doi: 10.1002/lary.30432. Epub 2022 Oct 13.

DOI:10.1002/lary.30432
PMID:36226791
Abstract

OBJECTIVES

Diagnostic tools for voice disorders are lacking for primary care physicians. Artificial intelligence (AI) tools may add to the armamentarium for physicians, decreasing the time to diagnosis and limiting the burden of dysphonia.

METHODS

Voice recordings of patients were collected from 2019 to 2021 using smartphones. The Saarbruecken dataset was included for comparison. Audio files were converted to mel-spectrograms using TensorFlow. Diagnostic categories were created to group pathology, including neurological and muscular disorders, inflammatory, mass lesions, and normal. The samples were further separated into sustained/a/and the rainbow passage.

RESULTS

Two hundred three prospective samples and 1131 samples were used from the Saarbruecken database. The AI detected abnormal pathology with an F1-score of 98%. The artificial neural network (ANN) differentiated key pathologies, including unilateral paralysis, laryngitis, adductor spasmodic dysphonia (ADSD), mass lesions, and normal samples with 39%-87% F-1 scores. The Calgary database models had higher F-1 scores in a head-to-head comparison to the Saarbruecken and combined datasets (87% vs. 58% and 50%). The AI outperformed otolaryngologists using a standardized test set of recordings (83% compared to 55% ± 15%).

CONCLUSION

An AI tool was created to differentiate pathology by individual or categorical diagnosis with high evaluation metrics. Prospective data should be collected in a controlled fashion to reduce intrinsic variability between recordings. Multi-center data collaborations are imperative to increase the prediction capability of AI tools for detecting vocal cord pathology. We provide proof-of-concept for an AI tool to assist primary care physicians in managing dysphonic patients.

LEVEL OF EVIDENCE

3 Laryngoscope, 133:1952-1960, 2023.

摘要

目的

初级保健医生缺乏用于诊断嗓音障碍的工具。人工智能(AI)工具可能会为医生提供更多的武器,减少诊断时间,并减轻嗓音障碍的负担。

方法

使用智能手机从 2019 年至 2021 年收集患者的嗓音录音。同时纳入 Saarbruecken 数据集进行比较。使用 TensorFlow 将音频文件转换为梅尔频谱图。创建诊断类别以对病理进行分组,包括神经和肌肉疾病、炎症、肿块病变和正常。进一步将样本分为持续/a/和彩虹通道。

结果

使用 Saarbruecken 数据库中的 203 个前瞻性样本和 1131 个样本。AI 检测异常病理的 F1 得分为 98%。人工神经网络(ANN)可区分关键病理,包括单侧麻痹、喉炎、内收肌痉挛性发音障碍(ADSD)、肿块病变和正常样本,F1 评分在 39%-87%之间。与 Saarbruecken 和合并数据集相比,Calgary 数据库模型在头对头比较中具有更高的 F1 评分(87%对 58%和 50%)。与耳鼻喉科医生使用标准化录音测试集相比,AI 表现更好(83%对 55%±15%)。

结论

创建了一种 AI 工具,可通过个体或分类诊断来区分病理,具有较高的评估指标。应通过受控方式收集前瞻性数据,以减少录音之间的固有变异性。多中心数据合作对于提高 AI 工具检测声带病理的预测能力至关重要。我们提供了一个 AI 工具来协助初级保健医生管理声音障碍患者的概念验证。

证据水平

3 Laryngoscope,133:1952-1960,2023。

相似文献

1
Developing an Artificial Intelligence Tool to Predict Vocal Cord Pathology in Primary Care Settings.开发一种人工智能工具,以预测初级保健环境中的声带病变。
Laryngoscope. 2023 Aug;133(8):1952-1960. doi: 10.1002/lary.30432. Epub 2022 Oct 13.
2
End-to-end deep learning classification of vocal pathology using stacked vowels.使用叠加元音的端到端深度学习进行嗓音病理学分类
Laryngoscope Investig Otolaryngol. 2023 Aug 31;8(5):1312-1318. doi: 10.1002/lio2.1144. eCollection 2023 Oct.
3
Deep Learning Application for Vocal Fold Disease Prediction Through Voice Recognition: Preliminary Development Study.深度学习在声门疾病预测中的应用:通过语音识别——初步开发研究
J Med Internet Res. 2021 Jun 8;23(6):e25247. doi: 10.2196/25247.
4
Deep learning in automatic detection of dysphonia: Comparing acoustic features and developing a generalizable framework.深度学习在嗓音障碍自动检测中的应用:比较声学特征并开发一个可推广的框架。
Int J Lang Commun Disord. 2023 Mar;58(2):279-294. doi: 10.1111/1460-6984.12783. Epub 2022 Sep 18.
5
Sulcus vocalis in spasmodic dysphonia-A retrospective study.痉挛性发音障碍中的声带沟:一项回顾性研究。
Am J Otolaryngol. 2021 May-Jun;42(3):102940. doi: 10.1016/j.amjoto.2021.102940. Epub 2021 Jan 28.
6
Vibratory Onset of Adductor Spasmodic Dysphonia and Muscle Tension Dysphonia: A High-Speed Video Study✰.《Adductor 痉挛性发声障碍和肌肉紧张性发声障碍的振动起始:高速视频研究✰》。
J Voice. 2020 Jul;34(4):598-603. doi: 10.1016/j.jvoice.2018.12.010. Epub 2018 Dec 28.
7
The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database.使用马来西亚语音病理学数据库检测语音病理学的在线序贯极限学习机的准确性。
J Otolaryngol Head Neck Surg. 2023 Sep 20;52(1):62. doi: 10.1186/s40463-023-00661-6.
8
A Measure of the Auditory-perceptual Quality of Strain from Electroglottographic Analysis of Continuous Dysphonic Speech: Application to Adductor Spasmodic Dysphonia.通过对持续性发声障碍语音进行电子声门图分析来测量嗓音紧张度的听觉感知质量:应用于内收型痉挛性发声障碍。
J Voice. 2016 Nov;30(6):770.e9-770.e21. doi: 10.1016/j.jvoice.2015.11.005. Epub 2015 Dec 28.
9
Decoding phonation with artificial intelligence (DeP AI): Proof of concept.利用人工智能解读发声(DeP AI):概念验证
Laryngoscope Investig Otolaryngol. 2019 Mar 25;4(3):328-334. doi: 10.1002/lio2.259. eCollection 2019 Jun.
10
Quantifying and Improving the Performance of Speech Recognition Systems on Dysphonic Speech.量化并提高语音识别系统对嗓音障碍语音的性能。
Otolaryngol Head Neck Surg. 2023 May;168(5):1130-1138. doi: 10.1002/ohn.170. Epub 2023 Jan 24.

引用本文的文献

1
Does ChatGPT update itself? Accuracy of ChatGPT in tympanostomy tube guidance: A comparative analysis with current literature.ChatGPT会自我更新吗?ChatGPT在鼓膜置管指导方面的准确性:与当前文献的比较分析。
Eur Arch Otorhinolaryngol. 2025 Aug 23. doi: 10.1007/s00405-025-09630-3.
2
AI and Primary Care: Scoping Review.人工智能与初级保健:范围综述
J Med Internet Res. 2025 Aug 15;27:e65950. doi: 10.2196/65950.
3
Artificial Intelligence in the Diagnosis and Treatment of Speech Disorders: Bridging Neurology and Otorhinolaryngology.
人工智能在言语障碍诊断与治疗中的应用:连接神经病学与耳鼻咽喉科学
Int Arch Otorhinolaryngol. 2025 May 29;29(2):1-2. doi: 10.1055/s-0045-1809334. eCollection 2025 Apr.
4
Application of artificial intelligence in laryngeal lesions: a systematic review and meta-analysis.人工智能在喉部病变中的应用:一项系统评价和荟萃分析。
Eur Arch Otorhinolaryngol. 2025 Mar;282(3):1543-1555. doi: 10.1007/s00405-024-09075-0. Epub 2024 Nov 22.
5
Responsible development of clinical speech AI: Bridging the gap between clinical research and technology.临床语音人工智能的负责任开发:弥合临床研究与技术之间的差距。
NPJ Digit Med. 2024 Aug 9;7(1):208. doi: 10.1038/s41746-024-01199-1.
6
Examining Diagnostic Errors in the Field of Otorhinolaryngology within the Challenging Landscape of Limited-Resource Healthcare.在资源有限的医疗保健这一充满挑战的环境中审视耳鼻喉科领域的诊断错误。
Indian J Otolaryngol Head Neck Surg. 2024 Jun;76(3):2714-2721. doi: 10.1007/s12070-024-04490-5. Epub 2024 Feb 13.
7
An introduction to machine learning and generative artificial intelligence for otolaryngologists-head and neck surgeons: a narrative review.耳鼻喉科-头颈外科医师的机器学习和生成式人工智能入门:叙述性综述。
Eur Arch Otorhinolaryngol. 2024 May;281(5):2723-2731. doi: 10.1007/s00405-024-08512-4. Epub 2024 Feb 23.
8
Design and evaluation of an intelligent physical examination system in improving the satisfaction of patients with chronic disease.智能体格检查系统在提高慢性病患者满意度方面的设计与评估
Heliyon. 2023 Dec 18;10(1):e23906. doi: 10.1016/j.heliyon.2023.e23906. eCollection 2024 Jan 15.