Assadi Atousa, Oreskovic Jessica, Kaufman Jaycee, Fossat Yan
Klick Applied Sciences, 175 Bloor St East, North Tower, 3rd floor, Toronto, ON, M4W3R8, Canada, 1 6472068717.
JMIR Biomed Eng. 2025 Jun 26;10:e64357. doi: 10.2196/64357.
The use of acoustic biomarkers derived from speech signals is a promising non-invasive technique for diagnosing type 2 diabetes mellitus (T2DM). Despite its potential, there remains a critical gap in knowledge regarding the optimal number of voice recordings and recording schedule necessary to achieve effective diagnostic accuracy.
This study aimed to determine the optimal number of voice samples and the ideal recording schedule (frequency and timing), required to maintain the T2DM diagnostic efficacy while reducing patient burden.
We analyzed voice recordings from 78 adults (22 women), including 39 individuals diagnosed with T2DM. Participants had a mean (SD) age of 45.26 (10.63) years and mean (SD) BMI of 28.07 (4.59) kg/m². In total, 5035 voice recordings were collected, with a mean (SD) of 4.91 (1.45) recordings per day; higher adherence was observed among women (5.13 [1.38] vs 4.82 [1.46] in men). We evaluated the diagnostic accuracy of a previously developed voice-based model under different recording conditions. Segmented linear regression analysis was used to assess model accuracy across varying numbers of voice recordings, and the Kendall tau correlation was used to measure the relationship between recording settings and accuracy. A significance threshold of P<.05 was applied.
Our results showed that including up to 6 voice recordings notably improved the model accuracy for T2DM compared to using only one recording, with accuracy increasing from 59.61 to 65.02 for men and from 65.55 to 69.43 for women. Additionally, the day on which voice recordings were collected did not significantly affect model accuracy (P>.05). However, adhering to recording within a single day demonstrated higher accuracy, with accuracy of 73.95% for women and 85.48% for men when all recordings were from the first and second days.
This study underscores the optimal voice recording settings to reduce patient burden while maintaining diagnostic efficacy.
利用源自语音信号的声学生物标志物是一种很有前景的用于诊断2型糖尿病(T2DM)的非侵入性技术。尽管其具有潜力,但在实现有效诊断准确性所需的最佳语音记录数量和记录时间表方面,仍存在关键的知识空白。
本研究旨在确定在减轻患者负担的同时维持T2DM诊断效力所需的最佳语音样本数量和理想记录时间表(频率和时间)。
我们分析了78名成年人(22名女性)的语音记录,其中包括39名被诊断为T2DM的个体。参与者的平均(标准差)年龄为45.26(10.63)岁,平均(标准差)体重指数为28.07(4.59)kg/m²。总共收集了5035份语音记录,平均(标准差)每天4.91(1.45)份记录;女性的依从性更高(女性为5.13 [1.38],男性为4.82 [1.46])。我们评估了先前开发的基于语音的模型在不同记录条件下的诊断准确性。分段线性回归分析用于评估不同数量语音记录时模型的准确性,肯德尔tau相关性用于衡量记录设置与准确性之间的关系。采用P<0.05的显著性阈值。
我们的结果表明,与仅使用一份记录相比,纳入多达6份语音记录显著提高了T2DM模型的准确性,男性的准确性从59.61提高到65.02,女性从65.55提高到69.43。此外,收集语音记录的日期对模型准确性没有显著影响(P>0.05)。然而,在同一天内坚持记录显示出更高的准确性,当所有记录都来自第一天和第二天时,女性的准确性为73.95%,男性为85.48%。
本研究强调了在维持诊断效力的同时减轻患者负担的最佳语音记录设置。