基于移动医疗系统的嗓音障碍检测：Vox4Health 临床评估研究的设计与结果

Voice Disorder Detection via an m-Health System: Design and Results of a Clinical Study to Evaluate Vox4Health.

机构信息

Department of Otorhinolaryngology, University Hospital (Policlinico) Federico II of Naples, Via S. Pansini 5, Naples, Italy.

Institute of High Performance Computing and Networking (ICAR-CNR), Via Pietro Castellino 111, Naples, Italy.

出版信息

Biomed Res Int. 2018 Aug 8;2018:8193694. doi: 10.1155/2018/8193694. eCollection 2018.

DOI:10.1155/2018/8193694

PMID:30175144

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6106917/

Abstract

OBJECTIVES

The current study presents a clinical evaluation of Vox4Health, an m-health system able to estimate the possible presence of a voice disorder by calculating and analyzing the main acoustic measures required for the acoustic analysis, namely, the Fundamental Frequency, jitter, shimmer, and Harmonic to Noise Ratio. The acoustic analysis is an objective, effective, and noninvasive tool used in clinical practice to perform a quantitative evaluation of voice quality.

MATERIALS AND METHODS

A clinical study was carried out in collaboration with medical staff of the University of Naples Federico II. 208 volunteers were recruited (mean age, 44.2 ± 13.9 years), 58 healthy subjects (mean age, 36.7 ± 13.3 years) and 150 pathological ones (mean age, 47 ± 13.1 years). The evaluation of Vox4Health was made in terms of classification performance, i.e., sensitivity, specificity, and accuracy, by using a rule-based algorithm that considers the most characteristic acoustic parameters to classify if the voice is healthy or pathological. The performance has been compared with that achieved by using Praat, one of the most commonly used tools in clinical practice.

RESULTS

Using a rule-based algorithm, the best accuracy in the detection of voice disorders, 72.6%, was obtained by using the jitter or shimmer value. Moreover, the best sensitivity is about 96% and it was always obtained by using jitter. Finally, the best specificity was achieved by using the Fundamental Frequency and it is equal to 56.9%. Additionally, in order to improve the classification accuracy of the next version of the Vox4Health app, an evaluation by using machine learning techniques was conducted. We performed some preliminary tests adopting different machine learning techniques able to classify the voice as healthy or pathological. The best accuracy (77.4%) was obtained by the Logistic Model Tree algorithm, while the best sensitivity (99.3%) was achieved using the Support Vector Machine. Finally, Instance-based Learning performed the best specificity (36.2%).

CONCLUSIONS

Considering the achieved accuracy, Vox4Health has been considered by the medical experts as a "good screening tool" for the detection of voice disorders in its current version. However, this accuracy is improved when machine learning classifiers are considered rather than the rule-based algorithm.

摘要

目的

本研究介绍了一种名为 Vox4Health 的移动医疗系统的临床评估，该系统能够通过计算和分析声学分析所需的主要声学测量值（即基频、抖动、颤抖和谐噪比）来估计声音障碍的可能存在。声学分析是一种客观、有效、非侵入性的工具，用于在临床实践中对语音质量进行定量评估。

材料与方法

与那不勒斯费德里克二世大学的医务人员合作进行了一项临床研究。共招募了 208 名志愿者（平均年龄 44.2 ± 13.9 岁），其中 58 名健康受试者（平均年龄 36.7 ± 13.3 岁）和 150 名病理受试者（平均年龄 47 ± 13.1 岁）。通过使用基于规则的算法评估 Vox4Health 的分类性能，即敏感性、特异性和准确性，该算法考虑了最具特征性的声学参数来判断声音是否健康或病理。并将性能与在临床实践中最常用的工具之一 Praat 的性能进行了比较。

结果

使用基于规则的算法，通过使用抖动或颤抖值，在检测声音障碍方面获得了最佳的准确性（72.6%）。此外，灵敏度最高约为 96%，始终通过使用抖动获得。最后，通过使用基频获得了最佳的特异性（56.9%）。此外，为了提高 Vox4Health 应用程序的下一个版本的分类准确性，进行了使用机器学习技术的评估。我们采用了不同的机器学习技术进行了一些初步测试，这些技术能够将声音分类为健康或病理。通过逻辑模型树算法获得了最佳的准确性（77.4%），而通过支持向量机获得了最佳的灵敏度（99.3%）。最后，基于实例的学习获得了最佳的特异性（36.2%）。

结论

考虑到所达到的准确性，Vox4Health 在当前版本中被医学专家认为是一种用于检测声音障碍的“良好筛查工具”。然而，当使用基于机器学习的分类器而不是基于规则的算法时，准确性会得到提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4e1/6106917/2c77dc51a773/BMRI2018-8193694.001.jpg

相似文献

Voice Disorder Detection via an m-Health System: Design and Results of a Clinical Study to Evaluate Vox4Health.

Biomed Res Int. 2018 Aug 8;2018:8193694. doi: 10.1155/2018/8193694. eCollection 2018.

Accuracy of Acoustic Analysis Measurements in the Evaluation of Patients With Different Laryngeal Diagnoses.

J Voice. 2017 May;31(3):382.e15-382.e26. doi: 10.1016/j.jvoice.2016.08.015. Epub 2016 Oct 11.

Combined Use of Standard and Throat Microphones for Measurement of Acoustic Voice Parameters and Voice Categorization.

J Voice. 2015 Sep;29(5):552-9. doi: 10.1016/j.jvoice.2014.10.008. Epub 2015 Mar 17.

Survey of Voice Acoustic Parameters in Iranian Female Teachers.

J Voice. 2016 Jul;30(4):507.e1-5. doi: 10.1016/j.jvoice.2015.05.020. Epub 2015 Aug 12.

Clinical relevance of speaking voice intensity effects on acoustic jitter and shimmer in children between 5;0 and 9;11 years.

Int J Pediatr Otorhinolaryngol. 2014 Dec;78(12):2121-6. doi: 10.1016/j.ijporl.2014.09.020. Epub 2014 Sep 28.

Relationship Between Acoustic Measurements and Self-evaluation in Patients With Voice Disorders.

J Voice. 2017 Jan;31(1):119.e1-119.e10. doi: 10.1016/j.jvoice.2016.02.021. Epub 2016 Apr 1.

Correlation Between Acoustic Measurements and Self-Reported Voice Disorders Among Female Teachers.

J Voice. 2016 Jul;30(4):460-5. doi: 10.1016/j.jvoice.2015.05.013. Epub 2015 Jun 19.

[Quantitative analysis of pathological voice and identification with artificial neural network].

Lin Chuang Er Bi Yan Hou Tou Jing Wai Ke Za Zhi. 2016 Jan 20;31(2):100-102. doi: 10.13201/j.issn.1001-1781.2017.02.005.

Acoustic analysis of voice in bulbar amyotrophic lateral sclerosis: a systematic review and meta-analysis of studies.

Logoped Phoniatr Vocol. 2020 Dec;45(4):151-163. doi: 10.1080/14015439.2019.1687748. Epub 2019 Nov 25.

Acoustic Voice Analysis and Maximum Phonation Time in Relation to Voice Handicap Index Score and Larynx Disease.

J Voice. 2020 Jan;34(1):161.e27-161.e35. doi: 10.1016/j.jvoice.2018.07.002. Epub 2018 Aug 6.

引用本文的文献

Voice signals database of ALS patients with different dysarthria severity and healthy controls.

Sci Data. 2024 Jul 19;11(1):800. doi: 10.1038/s41597-024-03597-2.

Voice disorder recognition using machine learning: a scoping review protocol.

BMJ Open. 2024 Feb 24;14(2):e076998. doi: 10.1136/bmjopen-2023-076998.

A novel hybrid model integrating MFCC and acoustic parameters for voice disorder detection.

Sci Rep. 2023 Dec 20;13(1):22719. doi: 10.1038/s41598-023-49869-6.

Voice disorder classification using convolutional neural network based on deep transfer learning.

Sci Rep. 2023 May 4;13(1):7264. doi: 10.1038/s41598-023-34461-9.

Exploring the Use of Artificial Intelligence Techniques to Detect the Presence of Coronavirus Covid-19 Through Speech and Voice Analysis.

IEEE Access. 2021 Apr 26;9:65750-65757. doi: 10.1109/ACCESS.2021.3075571. eCollection 2021.

本文引用的文献

Accuracy of Acoustic Analysis Measurements in the Evaluation of Patients With Different Laryngeal Diagnoses.

J Voice. 2017 May;31(3):382.e15-382.e26. doi: 10.1016/j.jvoice.2016.08.015. Epub 2016 Oct 11.

An iOS-based Cepstral Peak Prominence Application: Feasibility for Patient Practice of Resonant Voice.

J Voice. 2017 Jan;31(1):131.e9-131.e16. doi: 10.1016/j.jvoice.2015.11.022. Epub 2016 Feb 2.

Voice Disorders: Etiology and Diagnosis.

J Voice. 2016 Nov;30(6):761.e1-761.e9. doi: 10.1016/j.jvoice.2015.09.017. Epub 2015 Nov 4.

Assessment of voice quality: Current state-of-the-art.

Auris Nasus Larynx. 2015 Jun;42(3):183-8. doi: 10.1016/j.anl.2014.11.001. Epub 2014 Nov 28.

Reliability of OperaVOX against Multidimensional Voice Program (MDVP).

Clin Otolaryngol. 2015 Feb;40(1):22-8. doi: 10.1111/coa.12313.

SPIRIT 2013 explanation and elaboration: guidance for protocols of clinical trials.

BMJ. 2013 Jan 8;346:e7586. doi: 10.1136/bmj.e7586.

Diagnostic accuracy of history, laryngoscopy, and stroboscopy.

Laryngoscope. 2013 Jan;123(1):215-9. doi: 10.1002/lary.23630. Epub 2012 Oct 15.

The prevalence, diagnosis, and management of voice disorders in a National Ambulatory Medical Care Survey (NAMCS) cohort.

Laryngoscope. 2011 Jan;121(1):150-7. doi: 10.1002/lary.21169.

Spectral- and cepstral-based measures during continuous speech: capacity to distinguish dysphonia and consistency within a speaker.

J Voice. 2011 Sep;25(5):e223-32. doi: 10.1016/j.jvoice.2010.06.007. Epub 2010 Oct 25.

Machine learning for detection and diagnosis of disease.

Annu Rev Biomed Eng. 2006;8:537-65. doi: 10.1146/annurev.bioeng.8.061505.095802.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于移动医疗系统的嗓音障碍检测：Vox4Health 临床评估研究的设计与结果

Voice Disorder Detection via an m-Health System: Design and Results of a Clinical Study to Evaluate Vox4Health.

机构信息

出版信息

OBJECTIVES

MATERIALS AND METHODS

RESULTS

CONCLUSIONS

目的

材料与方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献