• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
EntroLLM: Leveraging Entropy and Large Language Model Embeddings for Enhanced Risk Prediction with Wearable Device Data.EntroLLM:利用熵和大语言模型嵌入技术,借助可穿戴设备数据增强风险预测。
AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:225-234. eCollection 2025.
2
Algorithmic Classification of Psychiatric Disorder-Related Spontaneous Communication Using Large Language Model Embeddings: Algorithm Development and Validation.使用大语言模型嵌入对精神障碍相关自发交流进行算法分类:算法开发与验证
JMIR AI. 2025 May 30;4:e67369. doi: 10.2196/67369.
3
Psychometric Evaluation of Large Language Model Embeddings for Personality Trait Prediction.用于人格特质预测的大语言模型嵌入的心理测量评估
J Med Internet Res. 2025 Jul 8;27:e75347. doi: 10.2196/75347.
4
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
5
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
6
Feasibility of cardiac-based seizure detection and prediction: A systematic review of non-invasive wearable sensor-based studies.基于心脏的癫痫检测和预测的可行性:基于非侵入性可穿戴传感器的研究的系统评价。
Epilepsia Open. 2024 Feb;9(1):41-59. doi: 10.1002/epi4.12854. Epub 2023 Nov 28.
7
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
8
Generalizable machine learning for stress monitoring from wearable devices: A systematic literature review.用于可穿戴设备压力监测的通用机器学习:系统文献综述
Int J Med Inform. 2023 May;173:105026. doi: 10.1016/j.ijmedinf.2023.105026. Epub 2023 Feb 28.
9
Using Large Language Models to Detect Depression From User-Generated Diary Text Data as a Novel Approach in Digital Mental Health Screening: Instrument Validation Study.利用大型语言模型从用户生成的日记文本数据中检测抑郁,作为数字心理健康筛查的新方法:仪器验证研究。
J Med Internet Res. 2024 Sep 18;26:e54617. doi: 10.2196/54617.
10
Fusing Domain Knowledge with a Fine-Tuned Large Language Model for Enhanced Molecular Property Prediction.将领域知识与微调的大语言模型融合以增强分子性质预测
J Chem Theory Comput. 2025 Jul 22;21(14):6743-6758. doi: 10.1021/acs.jctc.5c00605. Epub 2025 Jul 9.

本文引用的文献

1
Detecting hallucinations in large language models using semantic entropy.使用语义熵检测大型语言模型中的幻觉。
Nature. 2024 Jun;630(8017):625-630. doi: 10.1038/s41586-024-07421-0. Epub 2024 Jun 19.
2
Time series analysis in environmental epidemiology: challenges and considerations.环境流行病学中的时间序列分析:挑战与考虑因素。
Int J Occup Med Environ Health. 2023 Dec 15;36(6):704-716. doi: 10.13075/ijomeh.1896.02237. Epub 2023 Oct 2.
3
Time series big data: a survey on data stream frameworks, analysis and algorithms.时间序列大数据:关于数据流框架、分析与算法的综述
J Big Data. 2023;10(1):83. doi: 10.1186/s40537-023-00760-1. Epub 2023 May 28.
4
Generic medical concept embedding and time decay for diverse patient outcome prediction tasks.用于多种患者预后预测任务的通用医学概念嵌入和时间衰减
iScience. 2022 Aug 4;25(9):104880. doi: 10.1016/j.isci.2022.104880. eCollection 2022 Sep 16.
5
Does Physical Activity Predict Obesity-A Machine Learning and Statistical Method-Based Analysis.体力活动与肥胖的关系:基于机器学习和统计方法的分析。
Int J Environ Res Public Health. 2021 Apr 9;18(8):3966. doi: 10.3390/ijerph18083966.
6
A Predictive Model for Medical Events Based on Contextual Embedding of Temporal Sequences.一种基于时间序列上下文嵌入的医疗事件预测模型。
JMIR Med Inform. 2016 Nov 25;4(4):e39. doi: 10.2196/medinform.5977.
7
Associations between self-reported and objectively measured physical activity, sedentary behavior and overweight/obesity in NHANES 2003-2006.2003 - 2006年美国国家健康与营养检查调查(NHANES)中自我报告的与客观测量的身体活动、久坐行为及超重/肥胖之间的关联。
Int J Obes (Lond). 2017 Jan;41(1):186-193. doi: 10.1038/ijo.2016.168. Epub 2016 Sep 28.
8
National health and nutrition examination survey: analytic guidelines, 1999-2010.国家健康与营养检查调查:分析指南,1999 - 2010年
Vital Health Stat 2. 2013 Sep(161):1-24.
9
Amount of time spent in sedentary behaviors in the United States, 2003-2004.2003 - 2004年美国久坐行为的时长
Am J Epidemiol. 2008 Apr 1;167(7):875-81. doi: 10.1093/aje/kwm390. Epub 2008 Feb 25.

EntroLLM:利用熵和大语言模型嵌入技术,借助可穿戴设备数据增强风险预测。

EntroLLM: Leveraging Entropy and Large Language Model Embeddings for Enhanced Risk Prediction with Wearable Device Data.

作者信息

Huang Xueqing, Gu Tian

机构信息

Department of Biostatistics, Columbia University Mailman School of Public Health, New York, NY.

出版信息

AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:225-234. eCollection 2025.

PMID:40502232
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12150754/
Abstract

Wearable devices collect complex structured data with high-dimensional and time-series features that are challenging for traditional models to handle efficiently. We propose EntroLLM, a new method that combines entropy measures and the low-dimensional representation (embedding) generated from large language models (LLMs) to enhance risk prediction using wearable device data. In EntroLLM, the entropy quantifies the variability of a subject's physical activity patterns, while the LLM embedding approximates the latent temporal structure. We evaluate the feasibility and performance of EntroLLM using NHANES data to predict overweight status using demographics and physical activity collected from wearable devices. Results show that combining entropy with GPT-based embedding improves model performance compared to baseline models and other embedding techniques, leading to an average increase in AUC from 0.56 to 0.64. EntroLLM showcases the potential of combining entropy and LLM-based embedding and offers a promising approach to wearable device data analysis for predicting health outcomes.

摘要

可穿戴设备收集具有高维和时间序列特征的复杂结构化数据,传统模型难以有效处理这些数据。我们提出了EntroLLM,这是一种将熵度量与大语言模型(LLMs)生成的低维表示(嵌入)相结合的新方法,以增强使用可穿戴设备数据进行的风险预测。在EntroLLM中,熵量化了个体身体活动模式的变异性,而LLM嵌入则近似潜在的时间结构。我们使用美国国家健康与营养检查调查(NHANES)数据评估EntroLLM的可行性和性能,以利用从可穿戴设备收集的人口统计学和身体活动数据预测超重状态。结果表明,与基线模型和其他嵌入技术相比,将熵与基于GPT的嵌入相结合可提高模型性能,导致曲线下面积(AUC)平均从0.56提高到0.64。EntroLLM展示了结合熵和基于LLM的嵌入的潜力,并为可穿戴设备数据分析以预测健康结果提供了一种有前景的方法。