• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于信息不平衡的最大信息量特征选择:在 COVID-19 严重程度预测中的应用。

Maximally informative feature selection using Information Imbalance: Application to COVID-19 severity prediction.

机构信息

International School for Advanced Studies (SISSA), Via Bonomea 265, 34136, Trieste, Italy.

Infectious Disease Unit, Azienda Sanitaria Universitaria Friuli Centrale (ASU FC), Via Pozzuolo 330, 33100, Udine, Italy.

出版信息

Sci Rep. 2024 May 10;14(1):10744. doi: 10.1038/s41598-024-61334-6.

DOI:10.1038/s41598-024-61334-6
PMID:38730063
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11087653/
Abstract

Clinical databases typically include, for each patient, many heterogeneous features, for example blood exams, the clinical history before the onset of the disease, the evolution of the symptoms, the results of imaging exams, and many others. We here propose to exploit a recently developed statistical approach, the Information Imbalance, to compare different subsets of patient features and automatically select the set of features that is maximally informative for a given clinical purpose, especially in minority classes. We adapt the Information Imbalance approach to work in a clinical framework, where patient features are often categorical and are generally available only for a fraction of the patients. We apply this algorithm to a data set of 1300 patients treated for COVID-19 in Udine hospital before October 2021. Using this approach, we find combinations of features which, if used in combination, are maximally informative of the clinical fate and of the severity of the disease. The optimal number of features, which is determined automatically, turns out to be between 10 and 15. These features can be measured at admission. The approach can be used also if the features are available only for a fraction of the patients, does not require imputation and, importantly, is able to automatically select features with small inter-feature correlation. Clinical insights deriving from this study are also discussed.

摘要

临床数据库通常包含每个患者的许多异质特征,例如血液检查、疾病发作前的临床病史、症状的演变、影像学检查结果等。在这里,我们建议利用最近开发的统计方法——信息不平衡,来比较患者特征的不同子集,并自动选择对特定临床目的最具信息量的特征集,尤其是在少数类中。我们将信息不平衡方法应用于一个临床框架中,其中患者特征通常是分类的,并且仅在患者的一部分中可用。我们将该算法应用于 2021 年 10 月之前在乌迪内医院治疗的 1300 名 COVID-19 患者的数据集中。通过使用这种方法,我们找到了特征的组合,如果组合使用,这些组合可以最大程度地反映临床结果和疾病的严重程度。最佳特征数量是自动确定的,结果介于 10 到 15 个之间。这些特征可以在入院时测量。如果特征仅在患者的一部分中可用,该方法也可以使用,不需要插补,并且重要的是,能够自动选择特征之间相关性较小的特征。还讨论了从这项研究中得出的临床见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/11087653/6bc4e63d5039/41598_2024_61334_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/11087653/13ffca713090/41598_2024_61334_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/11087653/ffa04885b3ce/41598_2024_61334_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/11087653/65bc7c148608/41598_2024_61334_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/11087653/6bc4e63d5039/41598_2024_61334_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/11087653/13ffca713090/41598_2024_61334_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/11087653/ffa04885b3ce/41598_2024_61334_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/11087653/65bc7c148608/41598_2024_61334_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/11087653/6bc4e63d5039/41598_2024_61334_Fig4_HTML.jpg

相似文献

1
Maximally informative feature selection using Information Imbalance: Application to COVID-19 severity prediction.基于信息不平衡的最大信息量特征选择:在 COVID-19 严重程度预测中的应用。
Sci Rep. 2024 May 10;14(1):10744. doi: 10.1038/s41598-024-61334-6.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗或医院门诊环境中,出现症状和体征来确定患者是否患有 COVID-19。
Cochrane Database Syst Rev. 2021 Feb 23;2(2):CD013665. doi: 10.1002/14651858.CD013665.pub2.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
The selection of indicators from initial blood routine test results to improve the accuracy of early prediction of COVID-19 severity.从初始血常规检测结果中选择指标,提高 COVID-19 严重程度早期预测的准确性。
PLoS One. 2021 Jun 15;16(6):e0253329. doi: 10.1371/journal.pone.0253329. eCollection 2021.
5
Prediction Models for the Clinical Severity of Patients With COVID-19 in Korea: Retrospective Multicenter Cohort Study.韩国 COVID-19 患者临床严重程度预测模型:回顾性多中心队列研究。
J Med Internet Res. 2021 Apr 16;23(4):e25852. doi: 10.2196/25852.
6
Machine learning predictive model for severe COVID-19.机器学习预测严重 COVID-19 的模型。
Infect Genet Evol. 2021 Jun;90:104737. doi: 10.1016/j.meegid.2021.104737. Epub 2021 Jan 28.
7
Clinical Characterization and Prediction of Clinical Severity of SARS-CoV-2 Infection Among US Adults Using Data From the US National COVID Cohort Collaborative.利用美国国家 COVID 队列协作的数据,对美国成年人中 SARS-CoV-2 感染的临床特征和临床严重程度进行临床描述和预测。
JAMA Netw Open. 2021 Jul 1;4(7):e2116901. doi: 10.1001/jamanetworkopen.2021.16901.
8
Modelling of hypothetical SARS-CoV-2 point-of-care tests on admission to hospital from A&E: rapid cost-effectiveness analysis.急诊科入院时假设的 SARS-CoV-2 即时检测模型:快速成本效益分析。
Health Technol Assess. 2021 Mar;25(21):1-68. doi: 10.3310/hta25210.
9
Effectiveness and cost-effectiveness of four different strategies for SARS-CoV-2 surveillance in the general population (CoV-Surv Study): a structured summary of a study protocol for a cluster-randomised, two-factorial controlled trial.在普通人群中进行 SARS-CoV-2 监测的四种不同策略的有效性和成本效益(CoV-Surv 研究):一项关于集群随机、双因素对照试验的研究方案的结构化总结。
Trials. 2021 Jan 8;22(1):39. doi: 10.1186/s13063-020-04982-z.
10
Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation.利用人工智能预测和分析韩国 COVID-19 严重程度及特征重要性:模型建立与验证。
J Med Internet Res. 2021 Apr 19;23(4):e27060. doi: 10.2196/27060.

引用本文的文献

1
Investigating the relationship between the immune response and the severity of COVID-19: a large-cohort retrospective study.探究免疫反应与新型冠状病毒肺炎严重程度之间的关系:一项大型队列回顾性研究
Front Immunol. 2025 Jan 8;15:1452638. doi: 10.3389/fimmu.2024.1452638. eCollection 2024.
2
Automatic feature selection and weighting in molecular systems using Differentiable Information Imbalance.使用可微信息不平衡在分子系统中进行自动特征选择和加权。
Nat Commun. 2025 Jan 2;16(1):270. doi: 10.1038/s41467-024-55449-7.

本文引用的文献

1
Ranking the information content of distance measures.对距离度量的信息内容进行排序。
PNAS Nexus. 2022 Apr 14;1(2):pgac039. doi: 10.1093/pnasnexus/pgac039. eCollection 2022 May.
2
Cytokines from Bench to Bedside: A Retrospective Study Identifies a Definite Panel of Biomarkers to Early Assess the Risk of Negative Outcome in COVID-19 Patients.细胞因子从实验室到临床:一项回顾性研究确定了一组明确的生物标志物,可早期评估 COVID-19 患者不良结局的风险。
Int J Mol Sci. 2022 Apr 27;23(9):4830. doi: 10.3390/ijms23094830.
3
Antithrombin and Its Role in Host Defense and Inflammation.
抗凝血酶及其在宿主防御和炎症中的作用。
Int J Mol Sci. 2021 Apr 20;22(8):4283. doi: 10.3390/ijms22084283.
4
COVID-19 and liver disease: mechanistic and clinical perspectives.COVID-19 与肝病:机制与临床视角。
Nat Rev Gastroenterol Hepatol. 2021 May;18(5):348-364. doi: 10.1038/s41575-021-00426-4. Epub 2021 Mar 10.
5
Increased inflammatory markers correlate with liver damage and predict severe COVID-19: a systematic review and meta-analysis.炎症标志物升高与肝损伤相关,并可预测重症新型冠状病毒肺炎:一项系统评价和荟萃分析。
Gastroenterol Hepatol Bed Bench. 2020 Fall;13(4):282-291.
6
COVID-19 and the liver.COVID-19 与肝脏。
J Hepatol. 2020 Nov;73(5):1231-1240. doi: 10.1016/j.jhep.2020.06.006. Epub 2020 Jun 15.
7
Class prediction for high-dimensional class-imbalanced data.高维类别不平衡数据的类别预测。
BMC Bioinformatics. 2010 Oct 20;11:523. doi: 10.1186/1471-2105-11-523.
8
Covariance-regularized regression and classification for high-dimensional problems.用于高维问题的协方差正则化回归与分类
J R Stat Soc Series B Stat Methodol. 2009 Feb 20;71(3):615-636. doi: 10.1111/j.1467-9868.2009.00699.x.
9
Review: a gentle introduction to imputation of missing values.综述:缺失值插补的简要介绍
J Clin Epidemiol. 2006 Oct;59(10):1087-91. doi: 10.1016/j.jclinepi.2006.01.014. Epub 2006 Jul 11.
10
Estimating mutual information.估计互信息。
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Jun;69(6 Pt 2):066138. doi: 10.1103/PhysRevE.69.066138. Epub 2004 Jun 23.