• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用大语言模型进行认知诊断构建人类数字孪生(HDTwin):算法开发与验证

Building a Human Digital Twin (HDTwin) Using Large Language Models for Cognitive Diagnosis: Algorithm Development and Validation.

作者信息

Sprint Gina, Schmitter-Edgecombe Maureen, Cook Diane

机构信息

Department of Computer Science, Gonzaga University, Spokane, WA, United States.

School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA, United States.

出版信息

JMIR Form Res. 2024 Dec 23;8:e63866. doi: 10.2196/63866.

DOI:10.2196/63866
PMID:39715540
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11704625/
Abstract

BACKGROUND

Human digital twins have the potential to change the practice of personalizing cognitive health diagnosis because these systems can integrate multiple sources of health information and influence into a unified model. Cognitive health is multifaceted, yet researchers and clinical professionals struggle to align diverse sources of information into a single model.

OBJECTIVE

This study aims to introduce a method called HDTwin, for unifying heterogeneous data using large language models. HDTwin is designed to predict cognitive diagnoses and offer explanations for its inferences.

METHODS

HDTwin integrates cognitive health data from multiple sources, including demographic, behavioral, ecological momentary assessment, n-back test, speech, and baseline experimenter testing session markers. Data are converted into text prompts for a large language model. The system then combines these inputs with relevant external knowledge from scientific literature to construct a predictive model. The model's performance is validated using data from 3 studies involving 124 participants, comparing its diagnostic accuracy with baseline machine learning classifiers.

RESULTS

HDTwin achieves a peak accuracy of 0.81 based on the automated selection of markers, significantly outperforming baseline classifiers. On average, HDTwin yielded accuracy=0.77, precision=0.88, recall=0.63, and Matthews correlation coefficient=0.57. In comparison, the baseline classifiers yielded average accuracy=0.65, precision=0.86, recall=0.35, and Matthews correlation coefficient=0.36. The experiments also reveal that HDTwin yields superior predictive accuracy when information sources are fused compared to single sources. HDTwin's chatbot interface provides interactive dialogues, aiding in diagnosis interpretation and allowing further exploration of patient data.

CONCLUSIONS

HDTwin integrates diverse cognitive health data, enhancing the accuracy and explainability of cognitive diagnoses. This approach outperforms traditional models and provides an interface for navigating patient information. The approach shows promise for improving early detection and intervention strategies in cognitive health.

摘要

背景

人类数字替身有潜力改变个性化认知健康诊断的实践,因为这些系统能够将多种健康信息源及影响因素整合到一个统一模型中。认知健康是多方面的,但研究人员和临床专业人员难以将各种不同的信息源整合到一个单一模型中。

目的

本研究旨在介绍一种名为HDTwin的方法,用于使用大语言模型统一异构数据。HDTwin旨在预测认知诊断并为其推理提供解释。

方法

HDTwin整合来自多个来源的认知健康数据,包括人口统计学、行为、生态瞬时评估、n-back测试、语音以及基线实验者测试会话标记。数据被转换为针对大语言模型的文本提示。然后,该系统将这些输入与来自科学文献的相关外部知识相结合,以构建一个预测模型。使用来自3项涉及124名参与者的研究的数据验证模型的性能,将其诊断准确性与基线机器学习分类器进行比较。

结果

基于标记的自动选择,HDTwin达到了0.81的峰值准确率,显著优于基线分类器。平均而言,HDTwin的准确率为0.77,精确率为0.88,召回率为0.63,马修斯相关系数为0.57。相比之下,基线分类器的平均准确率为0.65,精确率为0.86,召回率为0.35,马修斯相关系数为0.36。实验还表明,与单一信息源相比,当信息源融合时,HDTwin具有更高的预测准确率。HDTwin的聊天机器人界面提供交互式对话,有助于诊断解释并允许进一步探索患者数据。

结论

HDTwin整合了多样的认知健康数据,提高了认知诊断的准确性和可解释性。这种方法优于传统模型,并提供了一个浏览患者信息的界面。该方法在改善认知健康的早期检测和干预策略方面显示出前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6ff/11704625/3db2dab2c259/formative_v8i1e63866_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6ff/11704625/48579d510ae9/formative_v8i1e63866_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6ff/11704625/85c4cb7d7d92/formative_v8i1e63866_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6ff/11704625/012a3730d57e/formative_v8i1e63866_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6ff/11704625/3db2dab2c259/formative_v8i1e63866_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6ff/11704625/48579d510ae9/formative_v8i1e63866_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6ff/11704625/85c4cb7d7d92/formative_v8i1e63866_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6ff/11704625/012a3730d57e/formative_v8i1e63866_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6ff/11704625/3db2dab2c259/formative_v8i1e63866_fig4.jpg

相似文献

1
Building a Human Digital Twin (HDTwin) Using Large Language Models for Cognitive Diagnosis: Algorithm Development and Validation.使用大语言模型进行认知诊断构建人类数字孪生(HDTwin):算法开发与验证
JMIR Form Res. 2024 Dec 23;8:e63866. doi: 10.2196/63866.
2
A Machine Learning Approach with Human-AI Collaboration for Automated Classification of Patient Safety Event Reports: Algorithm Development and Validation Study.一种人机协作的机器学习方法用于患者安全事件报告的自动分类:算法开发与验证研究
JMIR Hum Factors. 2024 Jan 25;11:e53378. doi: 10.2196/53378.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
MMAgentRec, a personalized multi-modal recommendation agent with large language model.MMAgentRec,一个带有大语言模型的个性化多模态推荐代理。
Sci Rep. 2025 Apr 8;15(1):12062. doi: 10.1038/s41598-025-96458-w.
5
Noninvasive Oral Hyperspectral Imaging-Driven Digital Diagnosis of Heart Failure With Preserved Ejection Fraction: Model Development and Validation Study.非侵入性口腔高光谱成像驱动的射血分数保留的心力衰竭数字诊断:模型开发与验证研究。
J Med Internet Res. 2025 Jan 7;27:e67256. doi: 10.2196/67256.
6
Usability Comparison Among Healthy Participants of an Anthropomorphic Digital Human and a Text-Based Chatbot as a Responder to Questions on Mental Health: Randomized Controlled Trial.健康参与者对拟人化数字人和基于文本的聊天机器人作为心理健康问题回答者的可用性比较:随机对照试验。
JMIR Hum Factors. 2024 Apr 29;11:e54581. doi: 10.2196/54581.
7
Use of Retrieval-Augmented Large Language Model for COVID-19 Fact-Checking: Development and Usability Study.使用检索增强大语言模型进行COVID-19事实核查:开发与可用性研究。
J Med Internet Res. 2025 Apr 30;27:e66098. doi: 10.2196/66098.
8
Leveraging Medical Knowledge Graphs Into Large Language Models for Diagnosis Prediction: Design and Application Study.将医学知识图谱融入大语言模型进行诊断预测:设计与应用研究
JMIR AI. 2025 Feb 24;4:e58670. doi: 10.2196/58670.
9
Extraction of Substance Use Information From Clinical Notes: Generative Pretrained Transformer-Based Investigation.从临床记录中提取物质使用信息:基于生成式预训练变换器的研究
JMIR Med Inform. 2024 Aug 19;12:e56243. doi: 10.2196/56243.
10
An Artificial Intelligence Approach for Test-Free Identification of Sarcopenia.一种用于无测试识别肌肉减少症的人工智能方法。
J Cachexia Sarcopenia Muscle. 2024 Dec;15(6):2765-2780. doi: 10.1002/jcsm.13627. Epub 2024 Nov 8.

引用本文的文献

1
Digital Twins for Personalized Medicine Require Epidemiological Data and Mathematical Modeling: Viewpoint.个性化医疗的数字孪生需要流行病学数据和数学建模:观点
J Med Internet Res. 2025 Aug 5;27:e72411. doi: 10.2196/72411.

本文引用的文献

1
Capturing cognitive capacity in the everyday environment across a continuum of cognitive decline using a smartwatch n-back task and ecological momentary assessment.使用智能手表n-back任务和生态瞬时评估,在日常环境中捕捉认知能力在认知衰退连续体上的变化。
Neuropsychology. 2025 Jan;39(1):28-43. doi: 10.1037/neu0000984. Epub 2024 Nov 18.
2
Characterizing Longitudinal Patterns in Cognition, Mood, And Activity in Depression With 6-Week High-Frequency Wearable Assessment: Observational Study.用 6 周高频可穿戴评估来描述抑郁症患者认知、情绪和活动的纵向变化模式:观察性研究。
JMIR Ment Health. 2024 May 31;11:e46895. doi: 10.2196/46895.
3
The Association Between Physical Activity and Cognition in a Racially/Ethnically Diverse Cohort of Older Adults: Results From the Kaiser Healthy Aging and Diverse Life Experiences Study.
身体活动与种族/民族多样化老年人群认知之间的关联:凯撒健康老龄化与多元生活经历研究的结果
Alzheimer Dis Assoc Disord. 2024;38(2):120-127. doi: 10.1097/WAD.0000000000000612. Epub 2024 Mar 27.
4
The Breakthrough of Large Language Models Release for Medical Applications: 1-Year Timeline and Perspectives.大语言模型在医疗应用方面的突破:1 年时间线与展望。
J Med Syst. 2024 Feb 17;48(1):22. doi: 10.1007/s10916-024-02045-3.
5
LLM Multimodal Traffic Accident Forecasting.大语言模型多模态交通事故预测
Sensors (Basel). 2023 Nov 16;23(22):9225. doi: 10.3390/s23229225.
6
Evaluating large language models on medical evidence summarization.基于医学证据总结对大语言模型进行评估。
NPJ Digit Med. 2023 Aug 24;6(1):158. doi: 10.1038/s41746-023-00896-7.
7
AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease.AD-BERT:利用预训练语言模型预测从轻度认知障碍到阿尔茨海默病的进展。
J Biomed Inform. 2023 Aug;144:104442. doi: 10.1016/j.jbi.2023.104442. Epub 2023 Jul 8.
8
Health system-scale language models are all-purpose prediction engines.健康系统规模的语言模型是通用的预测引擎。
Nature. 2023 Jul;619(7969):357-362. doi: 10.1038/s41586-023-06160-y. Epub 2023 Jun 7.
9
Association between physical activity and mild cognitive impairment in community-dwelling older adults: Depression as a mediator.社区居住的老年人身体活动与轻度认知障碍之间的关联:以抑郁作为中介因素
Front Aging Neurosci. 2022 Sep 8;14:964886. doi: 10.3389/fnagi.2022.964886. eCollection 2022.
10
BioGPT: generative pre-trained transformer for biomedical text generation and mining.BioGPT:用于生物医学文本生成和挖掘的生成式预训练转换器。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac409.