• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用人口统计学和用药史的大规模纵向数据预测阳性检测结果。

Predicting positive test results using large-scale longitudinal data of demographics and medication history.

作者信息

Pham Anh, El-Kareh Robert, Myers Frank, Ohno-Machado Lucila, Kuo Tsung-Ting

机构信息

Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, CA, USA.

UCSD Health System, San Diego, CA, USA.

出版信息

Heliyon. 2024 Dec 18;11(1):e41350. doi: 10.1016/j.heliyon.2024.e41350. eCollection 2025 Jan 15.

DOI:10.1016/j.heliyon.2024.e41350
PMID:39958729
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11825254/
Abstract

BACKGROUND

infection is a major health threat. Healthcare institutions have strong medical and financial incentives to keep infections under control. Blanket testing at admission is in general not recommended, and current predictive models either used moderate sample sizes, over-inflated the number of covariates, or chose non-interpretable algorithms. We aim to develop models using patient data to predict positive test results with discrimination performance, interpretable results, and a reasonable number of covariates that reflect health over a long-time span.

MATERIALS AND METHODS

We processed records from 157,493 University of California San Diego Health patients seen between January 01, 2016-July 03, 2019 with at least 6 months of medication history, excluding pregnant women, patients under 18, and prisoners. Three models (Logistic Regression, Random Forest, and Ensemble) were constructed using hyper-parameters selected through 10-fold cross-validation. Model performance was measured by the Area Under the Receiver Operating Characteristic Curve (AUROC). The model coefficients' odds ratios and p-values were calculated for the Logistic Regression model, as were Gini indices for Random Forest. Decision boundary analysis was conducted using pair-wise false positive and false negative cases each model would predict at a specific threshold.

RESULTS

Logistic Regression, Random Forest, and Ensemble models yielded test AUROCs of 0.839, 0.851, and 0.866, respectively. Significant covariates that may affect risk include age, immuno-compromised treatments, past antibiotic uses, and some medications for the gastrointestinal tract.

CONCLUSIONS

The models achieve high discrimination performance (AUROC >0.83). There is a general consensus among different analysis approaches regarding predictors that impact patients' chances of having a positive test, which may influence risk, including features clinically proven to increase susceptibility. These human-interpretable models can help distinguish significant predictors that affect a patient's chance of testing positive, which may influence their risk.

摘要

背景

感染是一项重大的健康威胁。医疗机构在控制感染方面有着强大的医学和经济动机。一般不建议在入院时进行全面检测,而且当前的预测模型要么使用的样本量适中,要么夸大了协变量的数量,要么选择了不可解释的算法。我们旨在利用患者数据开发模型,以预测检测呈阳性的结果,同时具备区分性能、可解释的结果以及反映长期健康状况的合理数量的协变量。

材料与方法

我们处理了2016年1月1日至2019年7月3日期间在加州大学圣地亚哥分校医疗中心就诊的157493名患者的记录,这些患者至少有6个月的用药史,排除了孕妇、18岁以下患者和囚犯。使用通过10折交叉验证选择的超参数构建了三种模型(逻辑回归、随机森林和集成模型)。模型性能通过受试者操作特征曲线下面积(AUROC)来衡量。计算了逻辑回归模型的模型系数的比值比和p值,以及随机森林的基尼指数。使用每个模型在特定阈值下预测的成对假阳性和假阴性病例进行决策边界分析。

结果

逻辑回归、随机森林和集成模型的测试AUROC分别为0.839、0.851和0.866。可能影响风险的显著协变量包括年龄、免疫抑制治疗、过去使用抗生素的情况以及一些胃肠道用药。

结论

这些模型具有较高的区分性能(AUROC>0.83)。不同分析方法在影响患者检测呈阳性几率的预测因素方面达成了普遍共识,这些因素可能会影响风险,包括经临床证实会增加易感性的特征。这些可人工解释的模型有助于区分影响患者检测呈阳性几率的重要预测因素,而这可能会影响他们的风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4f9/11825254/ec0f89834223/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4f9/11825254/b8ffa562eb01/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4f9/11825254/d675cad1e3d4/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4f9/11825254/6fdeac95facd/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4f9/11825254/ec0f89834223/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4f9/11825254/b8ffa562eb01/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4f9/11825254/d675cad1e3d4/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4f9/11825254/6fdeac95facd/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4f9/11825254/ec0f89834223/gr4.jpg

相似文献

1
Predicting positive test results using large-scale longitudinal data of demographics and medication history.利用人口统计学和用药史的大规模纵向数据预测阳性检测结果。
Heliyon. 2024 Dec 18;11(1):e41350. doi: 10.1016/j.heliyon.2024.e41350. eCollection 2025 Jan 15.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Validation of Clinical Risk Models for Clostridioides difficileAttributable Outcomes.临床风险模型对艰难梭菌感染相关结局的验证。
Antimicrob Agents Chemother. 2022 Jul 19;66(7):e0067622. doi: 10.1128/aac.00676-22. Epub 2022 Jun 21.
4
Systemic Inflammatory Mediators Are Effective Biomarkers for Predicting Adverse Outcomes in Clostridioides difficile Infection.系统炎症介质是预测艰难梭菌感染不良结局的有效生物标志物。
mBio. 2020 May 5;11(3):e00180-20. doi: 10.1128/mBio.00180-20.
5
Generalizability of predictive models for Clostridioides difficile infection, severity and recurrence at an urban safety-net hospital.预测模型在城市安全网医院中艰难梭菌感染、严重程度和复发的泛化能力。
J Hosp Infect. 2024 Apr;146:10-20. doi: 10.1016/j.jhin.2024.01.001. Epub 2024 Jan 12.
6
Improving Risk Prediction of Methicillin-Resistant Staphylococcus aureus Using Machine Learning Methods With Network Features: Retrospective Development Study.使用具有网络特征的机器学习方法改善耐甲氧西林金黄色葡萄球菌的风险预测:回顾性开发研究
JMIR AI. 2024 May 16;3:e48067. doi: 10.2196/48067.
7
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
8
A comparative analysis of machine learning approaches to predict C. difficile infection in hospitalized patients.机器学习方法预测住院患者艰难梭菌感染的比较分析。
Am J Infect Control. 2022 Mar;50(3):250-257. doi: 10.1016/j.ajic.2021.11.012. Epub 2022 Jan 20.
9
External Validation and Comparison of Clostridioides difficile Severity Scoring Systems.艰难梭菌严重程度评分系统的外部验证和比较。
Clin Infect Dis. 2022 Jun 10;74(11):2028-2035. doi: 10.1093/cid/ciab737.
10
Prediction of 28-Day All-Cause Mortality in Heart Failure Patients with Clostridioides difficile Infection Using Machine Learning Models: Evidence from the MIMIC-IV Database.使用机器学习模型预测艰难梭菌感染心力衰竭患者的28天全因死亡率:来自MIMIC-IV数据库的证据
Cardiology. 2025;150(2):133-144. doi: 10.1159/000540994. Epub 2024 Aug 17.

本文引用的文献

1
Predicting COVID-19 county-level case number trend by combining demographic characteristics and social distancing policies.通过结合人口特征和社会 distancing 政策预测县级 COVID-19 病例数趋势。 注:原文中“social distancing”直译为“社会距离”,在疫情语境下常指社交疏离等防控措施,这里可能是表述不完整,准确意思可能是“社交疏离政策” 。
JAMIA Open. 2022 Jun 25;5(3):ooac056. doi: 10.1093/jamiaopen/ooac056. eCollection 2022 Oct.
2
Generalizable prediction of COVID-19 mortality on worldwide patient data.基于全球患者数据对新冠肺炎死亡率进行可推广预测。
JAMIA Open. 2022 May 25;5(2):ooac036. doi: 10.1093/jamiaopen/ooac036. eCollection 2022 Jul.
3
Assessment of Federal Value-Based Incentive Programs and In-Hospital Clostridioides difficile Infection Rates.
评估联邦基于价值的激励计划与医院内艰难梭菌感染率
JAMA Netw Open. 2021 Oct 1;4(10):e2132114. doi: 10.1001/jamanetworkopen.2021.32114.
4
Bioaccumulation of therapeutic drugs by human gut bacteria.人类肠道细菌对治疗药物的生物累积。
Nature. 2021 Sep;597(7877):533-538. doi: 10.1038/s41586-021-03891-8. Epub 2021 Sep 8.
5
Clostridium difficile: Diagnosis and the Consequence of Over Diagnosis.艰难梭菌:诊断与过度诊断的后果
Infect Dis Ther. 2021 Jun;10(2):687-697. doi: 10.1007/s40121-021-00417-7. Epub 2021 Mar 26.
6
Privacy-preserving model learning on a blockchain network-of-networks.在区块链网络的网络上进行隐私保护模型学习。
J Am Med Inform Assoc. 2020 Mar 1;27(3):343-354. doi: 10.1093/jamia/ocz214.
7
Machine Learning-Based Predictive Modeling of Surgical Intervention in Glaucoma Using Systemic Data From Electronic Health Records.基于机器学习的电子健康记录系统数据预测青光眼手术干预
Am J Ophthalmol. 2019 Dec;208:30-40. doi: 10.1016/j.ajo.2019.07.005. Epub 2019 Jul 16.
8
Biocide Resistance and Transmission of Clostridium difficile Spores Spiked onto Clinical Surfaces from an American Health Care Facility.临床表面污染艰难梭菌孢子的抗药性和传播:来自美国医疗机构的研究。
Appl Environ Microbiol. 2019 Aug 14;85(17). doi: 10.1128/AEM.01090-19. Print 2019 Sep 1.
9
The revival of the Gini importance?基尼重要性的复兴?
Bioinformatics. 2018 Nov 1;34(21):3711-3718. doi: 10.1093/bioinformatics/bty373.
10
A Generalizable, Data-Driven Approach to Predict Daily Risk of Clostridium difficile Infection at Two Large Academic Health Centers.一种可推广的数据驱动方法,用于预测两个大型学术医疗中心的艰难梭菌感染的每日风险。
Infect Control Hosp Epidemiol. 2018 Apr;39(4):425-433. doi: 10.1017/ice.2018.16.