Department of Emergency Medicine, Stanford University, Palo Alto, California.
Department of Electrical & Computer Engineering, University of Toronto, Toronto, Ontario, Canada.
J Stud Alcohol Drugs. 2023 Nov;84(6):808-813. doi: 10.15288/jsad.22-00375. Epub 2023 Jun 9.
Devices such as mobile phones and smart speakers could be useful to remotely identify voice alterations associated with alcohol intoxication that could be used to deliver just-in-time interventions, but data to support such approaches for the English language are lacking. In this controlled laboratory study, we compare how well English spectrographic voice features identify alcohol intoxication.
A total of 18 participants (72% male, ages 21-62 years) read a randomly assigned tongue twister before drinking and each hour for up to 7 hours after drinking a weight-based dose of alcohol. Vocal segments were cleaned and split into 1-second windows. We built support vector machine models for detecting alcohol intoxication, defined as breath alcohol concentration > .08%, comparing the baseline voice spectrographic signature to each subsequent timepoint and examined accuracy with 95% confidence intervals (CIs).
Alcohol intoxication was predicted with an accuracy of 98% (95% CI [97.1, 98.6]); mean sensitivity = .98; specificity = .97; positive predictive value = .97; and negative predictive value = .98.
In this small, controlled laboratory study, voice spectrographic signatures collected from brief recorded English segments were useful in identifying alcohol intoxication. Larger studies using varied voice samples are needed to validate and expand models.
移动电话和智能音箱等设备可用于远程识别与酒精中毒相关的语音改变,从而可以实施及时干预,但缺乏支持英语此类方法的数据。本对照实验室研究旨在比较英语频谱语音特征识别酒精中毒的效果。
共 18 名参与者(72%为男性,年龄 21-62 岁)在饮酒前和饮酒后最多 7 小时内每小时随机阅读一段绕口令。对语音段进行清理并分为 1 秒的窗口。我们为检测酒精中毒(定义为呼气酒精浓度>0.08%)构建支持向量机模型,将基线语音频谱特征与每个后续时间点进行比较,并使用 95%置信区间(CI)检查准确性。
酒精中毒的预测准确率为 98%(95%CI [97.1, 98.6]);平均灵敏度为.98;特异性为.97;阳性预测值为.97;阴性预测值为.98。
在这项小型对照实验室研究中,从简短的英语记录片段中收集的语音频谱特征可用于识别酒精中毒。需要使用不同的语音样本进行更大规模的研究,以验证和扩展模型。