探索智能手机麦克风用于测量声学语音参数和语音病理学筛查的可行性。

Exploring the feasibility of smart phone microphone for measurement of acoustic voice parameters and voice pathology screening.

作者信息

Uloza Virgilijus, Padervinskis Evaldas, Vegiene Aurelija, Pribuisiene Ruta, Saferis Viktoras, Vaiciukynas Evaldas, Gelzinis Adas, Verikas Antanas

机构信息

Department of Otolaryngology, Lithuanian University of Health Sciences, Eiveniu 2, 50009, Kaunas, Lithuania.

Department of Physics, Mathematics and Biophysics, Lithuanian University of Health Sciences, Kaunas, Lithuania.

出版信息

Eur Arch Otorhinolaryngol. 2015 Nov;272(11):3391-9. doi: 10.1007/s00405-015-3708-4. Epub 2015 Jul 11.

DOI:10.1007/s00405-015-3708-4

PMID:26162450

Abstract

The objective of this study is to evaluate the reliability of acoustic voice parameters obtained using smart phone (SP) microphones and investigate the utility of use of SP voice recordings for voice screening. Voice samples of sustained vowel/a/obtained from 118 subjects (34 normal and 84 pathological voices) were recorded simultaneously through two microphones: oral AKG Perception 220 microphone and SP Samsung Galaxy Note3 microphone. Acoustic voice signal data were measured for fundamental frequency, jitter and shimmer, normalized noise energy (NNE), signal to noise ratio and harmonic to noise ratio using Dr. Speech software. Discriminant analysis-based Correct Classification Rate (CCR) and Random Forest Classifier (RFC) based Equal Error Rate (EER) were used to evaluate the feasibility of acoustic voice parameters classifying normal and pathological voice classes. Lithuanian version of Glottal Function Index (LT_GFI) questionnaire was utilized for self-assessment of the severity of voice disorder. The correlations of acoustic voice parameters obtained with two types of microphones were statistically significant and strong (r = 0.73-1.0) for the entire measurements. When classifying into normal/pathological voice classes, the Oral-NNE revealed the CCR of 73.7% and the pair of SP-NNE and SP-shimmer parameters revealed CCR of 79.5%. However, fusion of the results obtained from SP voice recordings and GFI data provided the CCR of 84.60% and RFC revealed the EER of 7.9%, respectively. In conclusion, measurements of acoustic voice parameters using SP microphone were shown to be reliable in clinical settings demonstrating high CCR and low EER when distinguishing normal and pathological voice classes, and validated the suitability of the SP microphone signal for the task of automatic voice analysis and screening.

摘要

本研究的目的是评估使用智能手机（SP）麦克风获得的声学语音参数的可靠性，并研究SP语音记录在语音筛查中的实用性。通过两个麦克风同时记录了118名受试者（34名正常嗓音和84名病理嗓音）发出的持续元音/a/的语音样本：口腔AKG Perception 220麦克风和SP三星Galaxy Note3麦克风。使用Dr. Speech软件测量声学语音信号数据的基频、抖动和闪烁、归一化噪声能量（NNE）、信噪比和谐波噪声比。基于判别分析的正确分类率（CCR）和基于随机森林分类器（RFC）的等错误率（EER）用于评估声学语音参数对正常和病理嗓音类别进行分类的可行性。使用立陶宛语版的声门功能指数（LT_GFI）问卷对嗓音障碍的严重程度进行自我评估。两种类型麦克风获得的声学语音参数之间的相关性在整个测量中具有统计学意义且很强（r = 0.73 - 1.0）。在将嗓音分为正常/病理类别时，口腔-NNE的CCR为73.7%，而SP-NNE和SP-闪烁参数对的CCR为79.5%。然而，将SP语音记录和GFI数据的结果融合后，CCR为84.60%，RFC的EER分别为7.9%。总之，在临床环境中，使用SP麦克风测量声学语音参数在区分正常和病理嗓音类别时显示出高CCR和低EER，是可靠的，并验证了SP麦克风信号适用于自动语音分析和筛查任务。

相似文献

Exploring the feasibility of smart phone microphone for measurement of acoustic voice parameters and voice pathology screening.

Eur Arch Otorhinolaryngol. 2015 Nov;272(11):3391-9. doi: 10.1007/s00405-015-3708-4. Epub 2015 Jul 11.

Combined Use of Standard and Throat Microphones for Measurement of Acoustic Voice Parameters and Voice Categorization.

J Voice. 2015 Sep;29(5):552-9. doi: 10.1016/j.jvoice.2014.10.008. Epub 2015 Mar 17.

Exploring the feasibility of the combination of acoustic voice quality index and glottal function index for voice pathology screening.

Eur Arch Otorhinolaryngol. 2019 Jun;276(6):1737-1745. doi: 10.1007/s00405-019-05433-5. Epub 2019 Apr 23.

Glottal function index questionnaire for screening of pediatric dysphonia.

Int J Pediatr Otorhinolaryngol. 2019 Aug;123:97-101. doi: 10.1016/j.ijporl.2019.04.045. Epub 2019 May 6.

An Innovative Voice Analyzer "VA" Smart Phone Program for Quantitative Analysis of Voice Quality.

J Voice. 2019 Sep;33(5):642-648. doi: 10.1016/j.jvoice.2018.01.026. Epub 2018 May 22.

Accuracy of Acoustic Voice Quality Index Captured With a Smartphone - Measurements With Added Ambient Noise.

J Voice. 2023 May;37(3):465.e19-465.e26. doi: 10.1016/j.jvoice.2021.01.025. Epub 2021 Mar 4.

[Study on the concordance of objective multi-parameters analysis and perceptual evaluation].

Zhonghua Er Bi Yan Hou Tou Jing Wai Ke Za Zhi. 2012 Oct;47(10):817-22.

Assessing voice health using smartphones: bias and random error of acoustic voice parameters captured by different smartphone types.

Int J Lang Commun Disord. 2019 Mar;54(2):292-305. doi: 10.1111/1460-6984.12457. Epub 2019 Feb 19.

Evaluation of Acoustic Analyses of Voice in Nonoptimized Conditions.

J Speech Lang Hear Res. 2020 Dec 14;63(12):3991-3999. doi: 10.1044/2020_JSLHR-20-00212. Epub 2020 Nov 13.

Are smartphones and low-cost external microphones comparable for measuring time-domain acoustic parameters?

Eur Arch Otorhinolaryngol. 2023 Dec;280(12):5433-5444. doi: 10.1007/s00405-023-08179-3. Epub 2023 Aug 16.

引用本文的文献

Pre-trained convolutional neural networks identify Parkinson's disease from spectrogram images of voice samples.

Sci Rep. 2025 Mar 1;15(1):7337. doi: 10.1038/s41598-025-92105-6.

Pre-trained Convolutional Neural Networks Identify Parkinson's Disease from Spectrogram Images of Voice Samples.

Res Sq. 2024 Dec 18:rs.3.rs-5348708. doi: 10.21203/rs.3.rs-5348708/v1.

Cross-device and test-retest reliability of speech acoustic measurements derived from consumer-grade mobile recording devices.

Behav Res Methods. 2024 Dec 30;57(1):35. doi: 10.3758/s13428-024-02584-0.

Validity of Acoustic Measures Obtained Using Various Recording Methods Including Smartphones With and Without Headset Microphones.

J Speech Lang Hear Res. 2024 Jun 6;67(6):1712-1730. doi: 10.1044/2024_JSLHR-23-00759. Epub 2024 May 15.

Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson's Disease: A Study on Speaker Diarization and Classification Techniques.

Sensors (Basel). 2024 Feb 26;24(5):1499. doi: 10.3390/s24051499.

A machine learning method to process voice samples for identification of Parkinson's disease.

Sci Rep. 2023 Nov 23;13(1):20615. doi: 10.1038/s41598-023-47568-w.

Are smartphones and low-cost external microphones comparable for measuring time-domain acoustic parameters?

Eur Arch Otorhinolaryngol. 2023 Dec;280(12):5433-5444. doi: 10.1007/s00405-023-08179-3. Epub 2023 Aug 16.

Atypical vocal quality in women with the FMR1 premutation: an indicator of impaired sensorimotor control.

Exp Brain Res. 2023 Aug;241(8):1975-1987. doi: 10.1007/s00221-023-06653-2. Epub 2023 Jun 22.

Voice analytics in the wild: Validity and predictive accuracy of common audio-recording devices.

Behav Res Methods. 2024 Mar;56(3):2114-2134. doi: 10.3758/s13428-023-02139-9. Epub 2023 May 30.

Profiles and predictors of onset based differences in vocal characteristics of adults with auditory neuropathy spectrum disorder (ANSD).

J Otol. 2022 Oct;17(4):218-225. doi: 10.1016/j.joto.2022.08.001. Epub 2022 Aug 14.

本文引用的文献

Delayed otolaryngology referral for voice disorders increases health care costs.

Am J Med. 2015 Apr;128(4):426.e11-8. doi: 10.1016/j.amjmed.2014.10.040. Epub 2014 Nov 18.

Reliability of OperaVOX against Multidimensional Voice Program (MDVP).

Clin Otolaryngol. 2015 Feb;40(1):22-8. doi: 10.1111/coa.12313.

The prevalence of voice problems among adults in the United States.

Laryngoscope. 2014 Oct;124(10):2359-62. doi: 10.1002/lary.24740. Epub 2014 May 27.

Pathological speech signal analysis and classification using empirical mode decomposition.

Med Biol Eng Comput. 2013 Jul;51(7):811-21. doi: 10.1007/s11517-013-1051-8. Epub 2013 Mar 5.

Signal-to-noise ratio adaptive post-filtering method for intelligibility enhancement of telephone speech.

J Acoust Soc Am. 2012 Dec;132(6):3990-4001. doi: 10.1121/1.4765074.

Multidirectional regression (MDR)-based features for automatic voice disorder detection.

J Voice. 2012 Nov;26(6):817.e19-27. doi: 10.1016/j.jvoice.2012.05.002.

Evaluating iPhone recordings for acoustic voice assessment.

Folia Phoniatr Logop. 2012;64(3):122-30. doi: 10.1159/000335874. Epub 2012 May 15.

Materials of acoustic analysis: sustained vowel versus sentence.

J Voice. 2012 Sep;26(5):563-5. doi: 10.1016/j.jvoice.2011.09.007. Epub 2012 Apr 18.

Telephone-quality pathological speech classification using empirical mode decomposition.

Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:7095-8. doi: 10.1109/IEMBS.2011.6091793.

Validation of the Lithuanian version of the Glottal Function Index.

J Voice. 2012 Mar;26(2):e73-8. doi: 10.1016/j.jvoice.2011.01.012. Epub 2011 May 31.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

探索智能手机麦克风用于测量声学语音参数和语音病理学筛查的可行性。

Exploring the feasibility of smart phone microphone for measurement of acoustic voice parameters and voice pathology screening.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献