• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data Augmentation.MedDiffusion:通过基于扩散的数据增强提升健康风险预测
Proc SIAM Int Conf Data Min. 2024;2024:499-507. doi: 10.1137/1.9781611978032.58.
2
AutoMed: Automated Medical Risk Predictive Modeling on Electronic Health Records.自动医疗:基于电子健康记录的自动化医疗风险预测建模
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec;2022:948-953. doi: 10.1109/bibm55620.2022.9995209.
3
InsightSleepNet: the interpretable and uncertainty-aware deep learning network for sleep staging using continuous Photoplethysmography.InsightSleepNet:一种可解释且具有不确定性感知能力的深度学习网络,用于使用连续光体积描记法进行睡眠分期。
BMC Med Inform Decis Mak. 2024 Feb 14;24(1):50. doi: 10.1186/s12911-024-02437-y.
4
Temporal self-attention for risk prediction from electronic health records using non-stationary kernel approximation.使用非平稳核逼近的电子健康记录风险预测的时间自注意力。
Artif Intell Med. 2024 Mar;149:102802. doi: 10.1016/j.artmed.2024.102802. Epub 2024 Feb 10.
5
TDASD: Generating medically significant fine-grained lung adenocarcinoma nodule CT images based on stable diffusion models with limited sample size.TDASD:基于有限样本量的稳定扩散模型生成具有医学意义的肺腺癌结节 CT 图像。
Comput Methods Programs Biomed. 2024 May;248:108103. doi: 10.1016/j.cmpb.2024.108103. Epub 2024 Mar 5.
6
A medical image classification method based on self-regularized adversarial learning.基于自正则化对抗学习的医学图像分类方法。
Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.
7
PolypMixNet: Enhancing semi-supervised polyp segmentation with polyp-aware augmentation.PolypMixNet:利用息肉感知增强进行半监督息肉分割。
Comput Biol Med. 2024 Mar;170:108006. doi: 10.1016/j.compbiomed.2024.108006. Epub 2024 Jan 15.
8
Continuous patient state attention model for addressing irregularity in electronic health records.用于解决电子健康记录中不规则性的连续患者状态注意模型。
BMC Med Inform Decis Mak. 2024 May 3;24(1):117. doi: 10.1186/s12911-024-02514-2.
9
Utilization of Synthetic Near-Infrared Spectra via Generative Adversarial Network to Improve Wood Stiffness Prediction.利用生成对抗网络的合成近红外光谱提高木材硬度预测
Sensors (Basel). 2024 Mar 21;24(6):1992. doi: 10.3390/s24061992.
10
al-BERT: a semi-supervised denoising technique for disease prediction.al-BERT:一种用于疾病预测的半监督去噪技术。
BMC Med Inform Decis Mak. 2024 May 16;24(1):127. doi: 10.1186/s12911-024-02528-w.

引用本文的文献

1
Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models.通过预测扩散模型合成多模态电子健康记录
KDD. 2024 Aug;2024:4607-4618. doi: 10.1145/3637528.3671836. Epub 2024 Aug 24.
2
Recent Advances in Predictive Modeling with Electronic Health Records.电子健康记录预测建模的最新进展
IJCAI (U S). 2024 Aug;2024:8272-8280. doi: 10.24963/ijcai.2024/914.

本文引用的文献

1
How we treat primary immune thrombocytopenia in adults.成人原发免疫性血小板减少症的治疗方法。
J Hematol Oncol. 2023 Jan 19;16(1):4. doi: 10.1186/s13045-023-01401-z.
2
Image Super-Resolution via Iterative Refinement.通过迭代细化实现图像超分辨率
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4713-4726. doi: 10.1109/TPAMI.2022.3204461. Epub 2023 Mar 7.
3
Synthetic minority oversampling of vital statistics data with generative adversarial networks.基于生成对抗网络的生命统计数据合成少数族裔过采样。
J Am Med Inform Assoc. 2020 Nov 1;27(11):1667-1674. doi: 10.1093/jamia/ocaa127.
4
Interpretable Representation Learning for Healthcare via Capturing Disease Progression through Time.通过捕捉疾病随时间的进展实现医疗保健领域的可解释表示学习
KDD. 2018 Aug;2018:43-51. doi: 10.1145/3219819.3219904.
5
RetainVis: Visual Analytics with Interpretable and Interactive Recurrent Neural Networks on Electronic Medical Records.RetainVis:基于电子病历的可解释交互式循环神经网络的可视化分析
IEEE Trans Vis Comput Graph. 2018 Aug 20. doi: 10.1109/TVCG.2018.2865027.
6
MIMIC-III, a freely accessible critical care database.MIMIC-III,一个免费获取的重症监护数据库。
Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.
7
Vascular dementia in patients with immune thrombocytopenic purpura.免疫性血小板减少性紫癜患者的血管性痴呆
Thromb Res. 2002 Sep 15;107(6):337-44. doi: 10.1016/s0049-3848(02)00337-7.
8
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.

MedDiffusion:通过基于扩散的数据增强提升健康风险预测

MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data Augmentation.

作者信息

Zhong Yuan, Cui Suhan, Wang Jiaqi, Wang Xiaochen, Yin Ziyi, Wang Yaqing, Xiao Houping, Huai Mengdi, Wang Ting, Ma Fenglong

机构信息

The Pennsylvania State University.

Purdue University.

出版信息

Proc SIAM Int Conf Data Min. 2024;2024:499-507. doi: 10.1137/1.9781611978032.58.

DOI:10.1137/1.9781611978032.58
PMID:39399239
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11469648/
Abstract

Health risk prediction aims to forecast the potential health risks that patients may face using their historical Electronic Health Records (EHR). Although several effective models have developed, data insufficiency is a key issue undermining their effectiveness. Various data generation and augmentation methods have been introduced to mitigate this issue by expanding the size of the training data set through learning underlying data distributions. However, the performance of these methods is often limited due to their task-unrelated design. To address these shortcomings, this paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion. It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space. Furthermore, MedDiffusion discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data. Experimental evaluation on four real-world medical datasets demonstrates that MedDiffusion outperforms 14 cutting-edge baselines in terms of PR-AUC, F1, and Cohen's Kappa. We also conduct ablation studies and benchmark our model against GAN-based alternatives to further validate the rationality and adaptability of our model design. Additionally, we analyze generated data to offer fresh insights into the model's interpretability. The source code is available via https://shorturl.at/aerT0.

摘要

健康风险预测旨在利用患者的历史电子健康记录(EHR)来预测患者可能面临的潜在健康风险。尽管已经开发了几种有效的模型,但数据不足是削弱其有效性的关键问题。已经引入了各种数据生成和增强方法,通过学习潜在数据分布来扩大训练数据集的大小,以缓解这个问题。然而,由于这些方法与任务无关的设计,它们的性能往往受到限制。为了解决这些缺点,本文介绍了一种新颖的、基于端到端扩散的风险预测模型,名为MedDiffusion。它通过在训练期间创建合成患者数据来扩大样本空间,从而提高风险预测性能。此外,MedDiffusion使用逐步注意力机制识别患者就诊之间的隐藏关系,使模型能够自动保留最重要的信息以生成高质量数据。对四个真实世界医疗数据集的实验评估表明,MedDiffusion在PR-AUC、F1和科恩卡帕方面优于14个前沿基线。我们还进行了消融研究,并将我们的模型与基于GAN的替代方案进行基准测试,以进一步验证我们模型设计的合理性和适应性。此外,我们分析生成的数据,以提供对模型可解释性的新见解。源代码可通过https://shorturl.at/aerT0获得。