Sri Sivasubramaniya Nadar College of Engineering, Tamil Nadu, India.
Int J Lang Commun Disord. 2024 May-Jun;59(3):1110-1127. doi: 10.1111/1460-6984.12973. Epub 2023 Nov 16.
Dementia is a cognitive decline that leads to the progressive deterioration of an individual's ability to perform daily activities independently. As a result, a considerable amount of time and resources are spent on caretaking. Early detection of dementia can significantly reduce the effort and resources needed for caretaking.
This research proposes an approach for assessing cognitive decline by analysing speech data, specifically focusing on speech relevance as a crucial indicator for memory recall.
METHODS & PROCEDURES: This is a cross-sectional, online, self-administered. The proposed method used deep learning architecture based on transformers, with BERT (Bidirectional Encoder Representations from Transformers) and Sentence-Transformer to derive encoded representations of speech transcripts. These representations provide contextually descriptive information that is used to analyse the relevance of sentences in their respective contexts. The encoded information is then compared using cosine similarity metrics to measure the relevance of uttered sequences of sentences. The study uses the Pitt Corpus Dementia dataset for experimentation, which consists of speech data from individuals with and without dementia. The accuracy of the proposed multi-QA-MPNet (Multi-Query Maximum Inner Product Search Pretraining) model is compared with other pretrained transformer models of Sentence-Transformer.
OUTCOMES & RESULTS: The results show that the proposed approach outperforms the other models in capturing context level information, particularly semantic memory. Additionally, the study explores the suitability of different similarity measures to evaluate the relevance of uttered sequences of sentences. The experimentation reveals that cosine similarity is the most appropriate measure for this task.
CONCLUSIONS & IMPLICATIONS: This finding has significant implications for the early warning signs of dementia, as it suggests that cosine similarity metrics can effectively capture the semantic relevance of spoken language. The persistent cognitive decline over time acts as one of the indicators for prevalence of dementia. Additionally early dementia could be recognised by analysis on other modalities like speech and brain images.
What is already known on this subject It is already known that speech- and language-based detection methods can be useful for dementia diagnosis, as language difficulties are often early signs of the disease. Additionally, deep learning algorithms have shown promise in detecting and diagnosing dementia through analysing large datasets, particularly in speech- and language-based detection methods. However, further research is needed to validate the performance of these algorithms on larger and more diverse datasets and to address potential biases and limitations. What this paper adds to existing knowledge This study presents a unique and effective approach for cognitive decline assessment through analysing speech data. The study provides valuable insights into the importance of context and semantic memory in accurately detecting the potential in dementia and demonstrates the applicability of deep learning models for this purpose. The findings of this study have important clinical implications and can inform future research and development in the field of dementia detection and care. What are the potential or actual clinical implications of this work? The proposed approach for cognitive decline assessment using speech data and deep learning models has significant clinical implications. It has the potential to improve the accuracy and efficiency of dementia diagnosis, leading to earlier detection and more effective treatments, which can improve patient outcomes and quality of life.
痴呆症是一种认知能力下降,导致个体独立进行日常活动的能力逐渐恶化。因此,需要大量的时间和资源来进行护理。早期发现痴呆症可以显著减少护理所需的努力和资源。
本研究提出了一种通过分析语音数据评估认知能力下降的方法,特别关注语音相关性作为记忆回忆的关键指标。
这是一项横断面、在线、自我管理的研究。所提出的方法使用基于转换器的深度学习架构,使用 BERT(来自转换器的双向编码器表示)和 Sentence-Transformer 从语音转录本中得出编码表示。这些表示提供上下文描述性信息,用于分析句子在各自上下文中的相关性。然后使用余弦相似性度量来比较编码信息,以衡量说出的句子序列的相关性。该研究使用 Pitt 语料库痴呆症数据集进行实验,该数据集包含来自有和没有痴呆症的个体的语音数据。与 Sentence-Transformer 的其他预训练转换器模型相比,评估了所提出的多问答-MPNet(多查询最大内积搜索预训练)模型的准确性。
结果表明,与其他模型相比,所提出的方法在捕获上下文级别信息方面表现更好,特别是语义记忆。此外,该研究还探讨了不同相似性度量标准评估说出的句子序列相关性的适用性。实验表明余弦相似性是最适合这项任务的度量标准。
这一发现对痴呆症的早期预警信号具有重要意义,因为它表明余弦相似性度量可以有效地捕捉口语的语义相关性。随着时间的推移,持续的认知能力下降是痴呆症流行的一个指标。此外,还可以通过分析其他模态(如语音和脑图像)来识别早期痴呆症。
已经知道,基于语音和语言的检测方法对于痴呆症诊断可能很有用,因为语言困难通常是疾病的早期迹象。此外,深度学习算法在通过分析大型数据集检测和诊断痴呆症方面显示出了前景,特别是在基于语音和语言的检测方法中。然而,需要进一步的研究来验证这些算法在更大和更多样化的数据集上的性能,并解决潜在的偏差和局限性。
本研究提出了一种通过分析语音数据评估认知能力下降的独特而有效的方法。该研究提供了关于上下文和语义记忆在准确检测痴呆症潜力方面的重要见解,并展示了深度学习模型在这方面的适用性。这项研究的结果具有重要的临床意义,并为痴呆症检测和护理领域的未来研究和发展提供了信息。
这项工作有什么潜在或实际的临床意义?使用语音数据和深度学习模型评估认知能力下降的方法具有重要的临床意义。它有可能提高痴呆症诊断的准确性和效率,从而更早地发现并更有效地治疗,从而改善患者的预后和生活质量。