Department of Cardiovascular Diseases, Mayo Clinic, Rochester, MN.
Department of Otorhinolaryngology, Mayo Clinic, Rochester, MN.
Mayo Clin Proc. 2018 Jul;93(7):840-847. doi: 10.1016/j.mayocp.2017.12.025. Epub 2018 Apr 12.
Voice signal analysis is an emerging noninvasive diagnostic tool. The current study tested the hypothesis that patient voice signal characteristics are associated with the presence of coronary artery disease (CAD).
The study population included 138 patients who were enrolled between January 1, 2015, and February 28, 2017: 37 control subjects and 101 subjects who underwent planned coronary angiogram. All subjects had their voice signal recorded to their smartphone 3 times: reading a text, describing a positive emotional experience, and describing a negative emotional experience. The Mel Frequency Cepstral Coefficients were used to extract prespecified voice features from all 3 recordings. Voice was recorded before the angiogram and analysis was blinded with respect to patient data.
Final study cohort included 101 patients, of whom 71 (71%) had CAD. Compared with subjects without CAD, patients with CAD were older (median, 63 years; interquartile range [IQR], 55-68 years vs median, 53 years; IQR, 42-66 years; P=.003) and had a higher 10-year atherosclerotic cardiovascular disease (ASCVD) risk score (9.4%; IQR, 5.0-18.7 vs 2.7%; IQR, 1.6-11.8; P=.005). Univariate binary logistic regression analysis identified 5 voice features that were associated with CAD (P<.05 for all). Multivariate binary logistic regression with adjustment for ASCVD risk score identified 2 voice features that were independently associated with CAD (odds ratio [OR], 0.37; 95% CI, 0.18-0.79; and 4.01; 95% CI, 1.25-12.84; P=.009 and P=.02, respectively). Both features were more strongly associated with CAD when patients were asked to describe an emotionally significant experience.
This study suggests a potential relationship between voice characteristics and CAD, with clinical implications for telemedicine-when clinical health care is provided at a distance.
语音信号分析是一种新兴的非侵入性诊断工具。本研究旨在验证患者的语音信号特征是否与冠状动脉疾病(CAD)的存在有关。
研究人群包括 2015 年 1 月 1 日至 2017 年 2 月 28 日期间入组的 138 名患者:37 名对照组和 101 名接受计划冠状动脉造影的患者。所有患者均使用智能手机录制 3 次语音信号:朗读文本、描述积极的情绪体验和描述消极的情绪体验。使用梅尔频率倒谱系数从所有 3 次录音中提取预设的语音特征。在进行血管造影之前录制语音,分析结果对患者数据进行盲法处理。
最终研究队列纳入 101 名患者,其中 71 名(71%)患有 CAD。与无 CAD 的患者相比,患有 CAD 的患者年龄更大(中位数 63 岁,四分位距[IQR] 55-68 岁;中位数 53 岁,IQR 42-66 岁;P=.003),10 年动脉粥样硬化性心血管疾病(ASCVD)风险评分更高(9.4%,IQR 5.0-18.7;2.7%,IQR 1.6-11.8;P=.005)。单变量二元逻辑回归分析确定了 5 个与 CAD 相关的语音特征(所有特征 P<.05)。经 ASCVD 风险评分校正的多变量二元逻辑回归分析确定了 2 个与 CAD 独立相关的语音特征(优势比[OR] 0.37,95%CI 0.18-0.79;和 4.01,95%CI 1.25-12.84;P=.009 和 P=.02)。当患者被要求描述情绪体验时,这两个特征与 CAD 的相关性更强。
本研究提示了语音特征与 CAD 之间存在潜在关系,为远程医疗(当临床医疗服务在远程进行时)提供了临床意义。