跨越“曲奇盗窃”语料库的鸿沟：将BERT从外部数据中学到的知识应用于ADReSS挑战痴呆症检测任务

Crossing the 'Cookie Theft' Corpus Chasm: Applying what BERT Learns from Outside Data to the ADReSS Challenge Dementia Detection Task.

作者信息

Guo Yue, Li Changye, Roan Carol, Pakhomov Serguei, Cohen Trevor

机构信息

Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington, USA.

Pharmaceutical Care and Health Systems, University of Minnesota, Minneapolis, Minnesota, USA.

出版信息

Front Comput Sci. 2021 Apr;3. doi: 10.3389/fcomp.2021.642517. Epub 2021 Apr 16.

DOI:10.3389/fcomp.2021.642517

PMID:40535703

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12176378/

Abstract

Large amounts of labeled data are a prerequisite to training accurate and reliable machine learning models. However, in the medical domain in particular, this is also a stumbling block as accurately labeled data are hard to obtain. DementiaBank, a publicly available corpus of spontaneous speech samples from a picture description task widely used to study Alzheimer's disease (AD) patients' language characteristics and for training classification models to distinguish patients with AD from healthy controls, is relatively small - a limitation that is further exacerbated when restricting to the balanced subset used in the Alzheimer's Dementia Recognition through Spontaneous Speech (ADReSS) challenge. We build on previous work showing that the performance of traditional machine learning models on DementiaBank can be improved by the addition of normative data from other sources, evaluating the utility of such extrinsic data to further improve the performance of state-of-the-art deep learning based methods on the ADReSS challenge dementia detection task. To this end, we developed a new corpus of professionally transcribed recordings from the Wisconsin Longitudinal Study (WLS), resulting in 1366 additional Cookie Theft Task transcripts, increasing the available training data by an order of magnitude. Using these data in conjunction with DementiaBank is challenging because the WLS metadata corresponding to these transcripts do not contain dementia diagnoses. However, cognitive status of WLS participants can be inferred from results of several cognitive tests including semantic verbal fluency available in WLS data. In this work, we evaluate the utility of using the entire WLS corpus as normative data as well as selecting normative data based on the inferred cognitive status for training deep learning models to discriminate between language produced by patients with dementia and healthy controls. We find that incorporating WLS data during training a BERT model on ADReSS data improves its performance on the ADReSS dementia detection task, supporting the hypotheses that incorporating WLS data adds value in this context. We also demonstrate that weighted cost functions and additional prediction targets may be effective ways to address issues arising from class imbalance and confounding effects due to data provenance.

摘要

大量的标注数据是训练准确可靠的机器学习模型的先决条件。然而，特别是在医学领域，这也是一个绊脚石，因为准确标注的数据很难获得。痴呆症语料库（DementiaBank）是一个公开可用的自发语音样本语料库，来自一个广泛用于研究阿尔茨海默病（AD）患者语言特征以及训练分类模型以区分AD患者和健康对照的图片描述任务，它相对较小——当限制在通过自发语音进行阿尔茨海默病痴呆识别（ADReSS）挑战赛中使用的平衡子集时，这一局限性会进一步加剧。我们基于之前的工作展开，之前的工作表明，通过添加来自其他来源的规范数据，可以提高传统机器学习模型在痴呆症语料库上的性能，我们评估这种外部数据在进一步提高基于深度学习的先进方法在ADReSS挑战赛痴呆症检测任务中的性能方面的效用。为此，我们开发了一个来自威斯康星纵向研究（WLS）的专业转录录音的新语料库，产生了1366个额外的《偷饼干任务》转录本，使可用训练数据增加了一个数量级。将这些数据与痴呆症语料库结合使用具有挑战性，因为与这些转录本对应的WLS元数据不包含痴呆症诊断信息。然而，可以从包括WLS数据中可用的语义言语流畅性在内的多项认知测试结果推断WLS参与者的认知状态。在这项工作中，我们评估了将整个WLS语料库用作规范数据以及根据推断的认知状态选择规范数据以训练深度学习模型来区分痴呆症患者和健康对照产生的语言的效用。我们发现，在基于ADReSS数据训练BERT模型时纳入WLS数据可提高其在ADReSS痴呆症检测任务中的性能，支持了在这种情况下纳入WLS数据会增加价值的假设。我们还证明，加权成本函数和额外的预测目标可能是解决因类别不平衡和数据来源导致的混杂效应而产生的问题的有效方法。

相似文献

Crossing the 'Cookie Theft' Corpus Chasm: Applying what BERT Learns from Outside Data to the ADReSS Challenge Dementia Detection Task.跨越“曲奇盗窃”语料库的鸿沟：将BERT从外部数据中学到的知识应用于ADReSS挑战痴呆症检测任务

Front Comput Sci. 2021 Apr;3. doi: 10.3389/fcomp.2021.642517. Epub 2021 Apr 16.

Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。

Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.

Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略

Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.

"Just Ask What Support We Need": Autistic Adults' Feedback on Social Skills Training.“只需询问我们需要什么支持”：成年自闭症患者对社交技能培训的反馈

Autism Adulthood. 2025 May 28;7(3):283-292. doi: 10.1089/aut.2023.0136. eCollection 2025 Jun.

Understanding and Overcoming Negative Attitudes That Hinder Adoption of Reablement in Dementia Care: An Explorative Qualitative Study.理解并克服阻碍痴呆症护理中采用康复护理的消极态度：一项探索性定性研究

J Multidiscip Healthc. 2025 Jun 12;18:3411-3422. doi: 10.2147/JMDH.S522515. eCollection 2025.

Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果：面向临床医生的网状Meta分析教程

Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.

Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标：模型开发与评估研究

JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.

Community views on mass drug administration for soil-transmitted helminths: a qualitative evidence synthesis.社区对土壤传播蠕虫群体药物给药的看法：定性证据综合分析

Cochrane Database Syst Rev. 2025 Jun 20;6:CD015794. doi: 10.1002/14651858.CD015794.pub2.

Leveraging Cognitive and Speech Ecological Momentary Assessment in Individuals With Phenylketonuria: Development and Usability Study of Cognitive Fluctuations in a Rare Disease Population.利用认知和言语生态瞬时评估法研究苯丙酮尿症患者：罕见病群体认知波动的开发与可用性研究

JMIR Form Res. 2025 Jun 3;9:e63644. doi: 10.2196/63644.

Molecular feature-based classification of retroperitoneal liposarcoma: a prospective cohort study.基于分子特征的腹膜后脂肪肉瘤分类：一项前瞻性队列研究。

Elife. 2025 May 23;14:RP100887. doi: 10.7554/eLife.100887.

引用本文的文献

Audio and linguistic prediction of objective and subjective cognition in older adults: what is the role of different prompts?老年人客观和主观认知的听觉及语言预测：不同提示的作用是什么？

Front Psychiatry. 2025 Jul 1;16:1596132. doi: 10.3389/fpsyt.2025.1596132. eCollection 2025.

Tailoring task arithmetic to address bias in models trained on multi-institutional datasets.调整任务算法以解决在多机构数据集上训练的模型中的偏差问题。

J Biomed Inform. 2025 Aug;168:104858. doi: 10.1016/j.jbi.2025.104858. Epub 2025 Jun 8.

本文引用的文献

Verbal fluency in a national sample: Telephone administration methods.在全国样本中进行言语流畅性测试：电话管理方法。

Int J Geriatr Psychiatry. 2019 Apr;34(4):578-587. doi: 10.1002/gps.5054. Epub 2019 Jan 18.

Deep language space neural network for classifying mild cognitive impairment and Alzheimer-type dementia.用于分类轻度认知障碍和阿尔茨海默病型痴呆的深度语言空间神经网络。

PLoS One. 2018 Nov 7;13(11):e0205636. doi: 10.1371/journal.pone.0205636. eCollection 2018.

Predicting probable Alzheimer's disease using linguistic deficits and biomarkers.利用语言缺陷和生物标志物预测可能的阿尔茨海默病。

BMC Bioinformatics. 2017 Jan 14;18(1):34. doi: 10.1186/s12859-016-1456-0.

Cognitive Decline in a Colombian Kindred With Autosomal Dominant Alzheimer Disease: A Retrospective Cohort Study.哥伦比亚常染色体显性阿尔茨海默病家族中的认知衰退：一项回顾性队列研究。

JAMA Neurol. 2016 Apr;73(4):431-8. doi: 10.1001/jamaneurol.2015.4851.

Linguistic Features Identify Alzheimer's Disease in Narrative Speech.语言特征可在叙述性言语中识别阿尔茨海默病。

J Alzheimers Dis. 2016;49(2):407-22. doi: 10.3233/JAD-150520.

Cognitive impairment 18 years before clinical diagnosis of Alzheimer disease dementia.在阿尔茨海默病痴呆临床诊断前18年出现认知障碍。

Neurology. 2015 Sep 8;85(10):898-904. doi: 10.1212/WNL.0000000000001774. Epub 2015 Jun 24.

The dementia diagnosis: a literature review of information, understanding, and attributions.痴呆症诊断：关于信息、理解与归因的文献综述

Psychogeriatrics. 2015 Sep;15(3):218-25. doi: 10.1111/psyg.12095. Epub 2014 Dec 16.

Cohort profile: Wisconsin longitudinal study (WLS).队列简介：威斯康星纵向研究（WLS）。

Int J Epidemiol. 2014 Feb;43(1):34-41. doi: 10.1093/ije/dys194.

Missed and delayed diagnosis of dementia in primary care: prevalence and contributing factors.初级保健中痴呆的漏诊和延迟诊断：患病率和相关因素。

Alzheimer Dis Assoc Disord. 2009 Oct-Dec;23(4):306-14. doi: 10.1097/WAD.0b013e3181a6bebc.

Inequalities in dementia care across Europe: key findings of the Facing Dementia Survey.欧洲痴呆症护理方面的不平等：直面痴呆症调查的主要发现

Int J Clin Pract Suppl. 2005 Mar(146):8-14. doi: 10.1111/j.1368-504x.2005.00480.x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。