Quatieri Thomas F, Talkar Tanya, Palmer Jeffrey S
MIT Lincoln Laboratory Lexington MA 02421 USA.
Harvard-MIT Speech and Hearing Bioscience and Technology ProgramHarvard Medical Sciences Boston MA 02115 USA.
IEEE Open J Eng Med Biol. 2020 May 29;1:203-206. doi: 10.1109/OJEMB.2020.2998051. eCollection 2020.
We propose a speech modeling and signal-processing framework to detect and track COVID-19 through asymptomatic and symptomatic stages. The approach is based on complexity of neuromotor coordination across speech subsystems involved in respiration, phonation and articulation, motivated by the distinct nature of COVID-19 involving lower (i.e., bronchial, diaphragm, lower tracheal) versus upper (i.e., laryngeal, pharyngeal, oral and nasal) respiratory tract inflammation, as well as by the growing evidence of the virus' neurological manifestations. An exploratory study with audio interviews of five subjects provides Cohen's d effect sizes between pre-COVID-19 (pre-exposure) and post-COVID-19 (after positive diagnosis but presumed asymptomatic) using: coordination of respiration (as measured through acoustic waveform amplitude) and laryngeal motion (fundamental frequency and cepstral peak prominence), and coordination of laryngeal and articulatory (formant center frequencies) motion. While there is a strong subject-dependence, the group-level morphology of effect sizes indicates a reduced complexity of subsystem coordination. Validation is needed with larger more controlled datasets and to address confounding influences such as different recording conditions, unbalanced data quantities, and changes in underlying vocal status from pre-to-post time recordings.
我们提出了一个语音建模和信号处理框架,用于在无症状和有症状阶段检测和追踪新冠病毒。该方法基于参与呼吸、发声和发音的语音子系统中神经运动协调的复杂性,其动机源于新冠病毒涉及下呼吸道(即支气管、膈肌、下气管)与上呼吸道(即喉、咽、口腔和鼻腔)炎症的独特性质,以及该病毒神经学表现的证据不断增加。一项对五名受试者进行音频访谈的探索性研究,使用呼吸协调(通过声学波形幅度测量)、喉部运动(基频和谐波峰值突出度)以及喉部与发音(共振峰中心频率)运动的协调,给出了新冠病毒感染前(暴露前)和感染后(阳性诊断但假定无症状)之间的科恩d效应量。虽然存在很强的个体依赖性,但效应量的组级形态表明子系统协调的复杂性降低。需要使用更大、更受控的数据集进行验证,并解决诸如不同记录条件、数据量不平衡以及记录前后潜在嗓音状态变化等混杂影响。