• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于大规模稀疏数据的特征选择模型:利用初级保健电子健康记录预测痴呆症患者的住院情况。

Modeling Large Sparse Data for Feature Selection: Hospital Admission Predictions of the Dementia Patients Using Primary Care Electronic Health Records.

机构信息

Department of Computer ScienceSwansea UniversitySwanseaSA1 8ENU.K.

Institute of Life Science, Swansea UniversitySwanseaSA1 8ENU.K.

出版信息

IEEE J Transl Eng Health Med. 2020 Nov 24;9:3000113. doi: 10.1109/JTEHM.2020.3040236. eCollection 2021.

DOI:10.1109/JTEHM.2020.3040236
PMID:33354439
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7737850/
Abstract

A growing elderly population suffering from incurable, chronic conditions such as dementia present a continual strain on medical services due to mental impairment paired with high comorbidity resulting in increased hospitalization risk. The identification of at risk individuals allows for preventative measures to alleviate said strain. Electronic health records provide opportunity for big data analysis to address such applications. Such data however, provides a challenging problem space for traditional statistics and machine learning due to high dimensionality and sparse data elements. This article proposes a novel machine learning methodology: entropy regularization with ensemble deep neural networks (ECNN), which simultaneously provides high predictive performance of hospitalization of patients with dementia whilst enabling an interpretable heuristic analysis of the model architecture, able to identify individual features of importance within a large feature domain space. Experimental results on health records containing 54,647 features were able to identify 10 event indicators within a patient timeline: a collection of diagnostic events, medication prescriptions and procedural events, the highest ranked being essential hypertension. The resulting subset was still able to provide a highly competitive hospitalization prediction (Accuracy: 0.759) as compared to the full feature domain (Accuracy: 0.755) or traditional feature selection techniques (Accuracy: 0.737), a significant reduction in feature size. The discovery and heuristic evidence of correlation provide evidence for further clinical study of said medical events as potential novel indicators. There also remains great potential for adaption of ECNN within other medical big data domains as a data mining tool for novel risk factor identification.

摘要

不断增长的老年人口患有无法治愈的慢性疾病,如痴呆症,由于精神障碍与高合并症并存,导致住院风险增加,这给医疗服务带来了持续的压力。识别高危人群可以采取预防措施来减轻这种压力。电子健康记录为大数据分析提供了机会,以解决此类应用问题。然而,由于高维性和稀疏的数据元素,这种数据为传统的统计和机器学习提供了具有挑战性的问题空间。本文提出了一种新的机器学习方法:基于集成深度神经网络的熵正则化(ECNN),该方法同时提供了痴呆症患者住院的高预测性能,同时能够对模型结构进行可解释的启发式分析,从而能够识别出大特征域空间中的重要单个特征。在包含 54647 个特征的健康记录上进行的实验结果能够在患者时间线上识别出 10 个事件指标:一组诊断事件、药物处方和程序事件,排名最高的是原发性高血压。与完整特征域(Accuracy: 0.755)或传统特征选择技术(Accuracy: 0.737)相比,所得到的子集仍然能够提供高度竞争的住院预测(Accuracy: 0.759),这大大减少了特征的大小。相关性的发现和启发式证据为进一步研究这些医疗事件作为潜在的新指标提供了证据。ECNN 在其他医疗大数据领域作为一种新的风险因素识别数据挖掘工具,也具有很大的适应性潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/f31e20f6844c/xie6ab-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/0791d8d6eca3/xie1-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/58a081caba32/xie2-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/9ced76ec2837/xie3-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/4f803afff766/xie4-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/b067ea8b3db0/xie5-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/f31e20f6844c/xie6ab-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/0791d8d6eca3/xie1-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/58a081caba32/xie2-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/9ced76ec2837/xie3-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/4f803afff766/xie4-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/b067ea8b3db0/xie5-3040236.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc4/7737850/f31e20f6844c/xie6ab-3040236.jpg

相似文献

1
Modeling Large Sparse Data for Feature Selection: Hospital Admission Predictions of the Dementia Patients Using Primary Care Electronic Health Records.基于大规模稀疏数据的特征选择模型:利用初级保健电子健康记录预测痴呆症患者的住院情况。
IEEE J Transl Eng Health Med. 2020 Nov 24;9:3000113. doi: 10.1109/JTEHM.2020.3040236. eCollection 2021.
2
Machine Learning Model for Risk Prediction of Community-Acquired Acute Kidney Injury Hospitalization From Electronic Health Records: Development and Validation Study.基于电子健康记录的社区获得性急性肾损伤住院风险预测的机器学习模型:开发和验证研究。
J Med Internet Res. 2020 Aug 4;22(8):e16903. doi: 10.2196/16903.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records.动态可解释机器学习预测 ICU 患者死亡率:电子患者记录中高频数据的回顾性研究。
Lancet Digit Health. 2020 Apr;2(4):e179-e191. doi: 10.1016/S2589-7500(20)30018-2. Epub 2020 Mar 12.
5
Identifying the presence and severity of dementia by applying interpretable machine learning techniques on structured clinical records.通过在结构化临床记录上应用可解释的机器学习技术来识别痴呆的存在和严重程度。
BMC Med Inform Decis Mak. 2022 Oct 17;22(1):271. doi: 10.1186/s12911-022-02004-3.
6
Accurate Prediction of Coronary Heart Disease for Patients With Hypertension From Electronic Health Records With Big Data and Machine-Learning Methods: Model Development and Performance Evaluation.利用大数据和机器学习方法从电子健康记录中准确预测高血压患者的冠心病:模型开发与性能评估
JMIR Med Inform. 2020 Jul 6;8(7):e17257. doi: 10.2196/17257.
7
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.
8
Identification of clinical factors related to prediction of alcohol use disorder from electronic health records using feature selection methods.利用特征选择方法从电子健康记录中识别与预测酒精使用障碍相关的临床因素。
BMC Med Inform Decis Mak. 2022 Nov 23;22(1):304. doi: 10.1186/s12911-022-02051-w.
9
Federated learning of predictive models from federated Electronic Health Records.从联邦电子健康记录中联合学习预测模型。
Int J Med Inform. 2018 Apr;112:59-67. doi: 10.1016/j.ijmedinf.2018.01.007. Epub 2018 Jan 12.
10
Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study.基于电子病历中的屈光数据预测中国学龄儿童近视进展:一项回顾性、多中心机器学习研究。
PLoS Med. 2018 Nov 6;15(11):e1002674. doi: 10.1371/journal.pmed.1002674. eCollection 2018 Nov.

引用本文的文献

1
Medical history predicts phenome-wide disease onset and enables the rapid response to emerging health threats.病史可预测全表型疾病的发病情况,并能对新出现的健康威胁做出快速反应。
Nat Commun. 2025 Jan 10;16(1):585. doi: 10.1038/s41467-025-55879-x.
2
Predicting unplanned admissions to hospital in older adults using routinely recorded general practice data: development and validation of a prediction model.利用常规记录的全科医疗数据预测老年人的非计划性住院:预测模型的建立和验证。
Br J Gen Pract. 2024 Aug 29;74(746):e628-e636. doi: 10.3399/BJGP.2023.0350. Print 2024 Sep.
3
Application of machine learning in measurement of ageing and geriatric diseases: a systematic review.

本文引用的文献

1
Harnessing the Power of Machine Learning in Dementia Informatics Research: Issues, Opportunities, and Challenges.利用机器学习在痴呆症信息学研究中的力量:问题、机遇和挑战。
IEEE Rev Biomed Eng. 2020;13:113-129. doi: 10.1109/RBME.2019.2904488. Epub 2019 Mar 12.
2
Cox-nnet: An artificial neural network method for prognosis prediction of high-throughput omics data.Cox-nnet:一种用于高通量组学数据预后预测的人工神经网络方法。
PLoS Comput Biol. 2018 Apr 10;14(4):e1006076. doi: 10.1371/journal.pcbi.1006076. eCollection 2018 Apr.
3
Sequential regulatory activity prediction across chromosomes with convolutional neural networks.
机器学习在衰老和老年疾病测量中的应用:系统评价。
BMC Geriatr. 2023 Dec 12;23(1):841. doi: 10.1186/s12877-023-04477-x.
4
A systematic review of clinical health conditions predicted by machine learning diagnostic and prognostic models trained or validated using real-world primary health care data.基于真实初级医疗保健数据进行训练或验证的机器学习诊断和预后模型预测的临床健康状况的系统评价。
PLoS One. 2023 Sep 8;18(9):e0274276. doi: 10.1371/journal.pone.0274276. eCollection 2023.
5
Quercetin inhibits angiotensin II-induced vascular smooth muscle cell proliferation and activation of JAK2/STAT3 pathway: A target based networking pharmacology approach.槲皮素抑制血管紧张素II诱导的血管平滑肌细胞增殖及JAK2/STAT3信号通路激活:基于靶点的网络药理学方法
Front Pharmacol. 2022 Oct 17;13:1002363. doi: 10.3389/fphar.2022.1002363. eCollection 2022.
6
Appositeness of Optimized and Reliable Machine Learning for Healthcare: A Survey.优化且可靠的机器学习在医疗保健中的适用性:一项综述。
Arch Comput Methods Eng. 2022;29(6):3981-4003. doi: 10.1007/s11831-022-09733-8. Epub 2022 Mar 22.
7
Use of Machine Learning and Artificial Intelligence Methods in Geriatric Mental Health Research Involving Electronic Health Record or Administrative Claims Data: A Systematic Review.机器学习和人工智能方法在涉及电子健康记录或行政索赔数据的老年心理健康研究中的应用:一项系统综述。
Front Psychiatry. 2021 Sep 20;12:738466. doi: 10.3389/fpsyt.2021.738466. eCollection 2021.
基于卷积神经网络的跨染色体顺序调控活性预测
Genome Res. 2018 May;28(5):739-750. doi: 10.1101/gr.227819.117. Epub 2018 Mar 27.
4
Identifying incipient dementia individuals using machine learning and amyloid imaging.使用机器学习和淀粉样蛋白成像识别早期痴呆症患者。
Neurobiol Aging. 2017 Nov;59:80-90. doi: 10.1016/j.neurobiolaging.2017.06.027. Epub 2017 Jul 11.
5
Associations of lower vitamin D concentrations with cognitive decline and long-term risk of dementia and Alzheimer's disease in older adults.老年人维生素 D 浓度较低与认知能力下降及痴呆和阿尔茨海默病长期风险的关联。
Alzheimers Dement. 2017 Nov;13(11):1207-1216. doi: 10.1016/j.jalz.2017.03.003. Epub 2017 May 16.
6
The worldwide costs of dementia 2015 and comparisons with 2010.2015年全球痴呆症成本及与2010年的比较。
Alzheimers Dement. 2017 Jan;13(1):1-7. doi: 10.1016/j.jalz.2016.07.150. Epub 2016 Aug 29.
7
Socioeconomic Disparities and Mortality After a Diagnosis of Dementia: Results From a Nationwide Registry Linkage Study.痴呆症诊断后的社会经济差异与死亡率:一项全国性登记关联研究的结果
Am J Epidemiol. 2016 Aug 1;184(3):219-26. doi: 10.1093/aje/kwv319. Epub 2016 Jul 5.
8
Multiple anticholinergic medication use and risk of hospital admission for confusion or dementia.多种抗胆碱能药物的使用与因意识模糊或痴呆而住院的风险
J Am Geriatr Soc. 2014 Oct;62(10):1916-22. doi: 10.1111/jgs.13054. Epub 2014 Oct 3.
9
Machine learning approaches to diagnosis and laterality effects in semantic dementia discourse.语义性痴呆话语中诊断和偏侧性效应的机器学习方法。
Cortex. 2014 Jun;55:122-9. doi: 10.1016/j.cortex.2013.05.008. Epub 2013 Jun 14.
10
Causes of hospital admission for people with dementia: a systematic review and meta-analysis.痴呆患者住院的原因:系统评价和荟萃分析。
J Am Med Dir Assoc. 2013 Jul;14(7):463-70. doi: 10.1016/j.jamda.2013.01.011. Epub 2013 Mar 17.