• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生存模型中删失数据对贝叶斯网络学习的影响。

Impact of censoring on learning Bayesian networks in survival modelling.

机构信息

Department of Automation, Electronics and Computing, Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia.

出版信息

Artif Intell Med. 2009 Nov;47(3):199-217. doi: 10.1016/j.artmed.2009.08.001. Epub 2009 Oct 14.

DOI:10.1016/j.artmed.2009.08.001
PMID:19833488
Abstract

OBJECTIVE

Bayesian networks are commonly used for presenting uncertainty and covariate interactions in an easily interpretable way. Because of their efficient inference and ability to represent causal relationships, they are an excellent choice for medical decision support systems in diagnosis, treatment, and prognosis. Although good procedures for learning Bayesian networks from data have been defined, their performance in learning from censored survival data has not been widely studied. In this paper, we explore how to use these procedures to learn about possible interactions between prognostic factors and their influence on the variate of interest. We study how censoring affects the probability of learning correct Bayesian network structures. Additionally, we analyse the potential usefulness of the learnt models for predicting the time-independent probability of an event of interest.

METHODS AND MATERIALS

We analysed the influence of censoring with a simulation on synthetic data sampled from randomly generated Bayesian networks. We used two well-known methods for learning Bayesian networks from data: a constraint-based method and a score-based method. We compared the performance of each method under different levels of censoring to those of the naive Bayes classifier and the proportional hazards model. We did additional experiments on several datasets from real-world medical domains. The machine-learning methods treated censored cases in the data as event-free.

RESULTS

We report and compare results for several commonly used model evaluation metrics. On average, the proportional hazards method outperformed other methods in most censoring setups. As part of the simulation study, we also analysed structural similarities of the learnt networks. Heavy censoring, as opposed to no censoring, produces up to a 5% surplus and up to 10% missing total arcs. It also produces up to 50% missing arcs that should originally be connected to the variate of interest.

CONCLUSION

Presented methods for learning Bayesian networks from data can be used to learn from censored survival data in the presence of light censoring (up to 20%) by treating censored cases as event-free. Given intermediate or heavy censoring, the learnt models become tuned to the majority class and would thus require a different approach.

摘要

目的

贝叶斯网络常用于以易于理解的方式呈现不确定性和协变量交互。由于其高效的推断能力和表示因果关系的能力,它们是诊断、治疗和预后中医疗决策支持系统的绝佳选择。尽管已经定义了从数据中学习贝叶斯网络的良好程序,但它们在从有 censored 生存数据中学习的性能尚未得到广泛研究。在本文中,我们探讨了如何使用这些程序来了解预后因素之间的可能相互作用及其对感兴趣变量的影响。我们研究了 censoring 如何影响学习正确贝叶斯网络结构的概率。此外,我们分析了学习模型在预测独立于时间的感兴趣事件的概率方面的潜在有用性。

方法和材料

我们使用从随机生成的贝叶斯网络中采样的合成数据进行模拟,分析 censoring 的影响。我们使用两种从数据中学习贝叶斯网络的知名方法:基于约束的方法和基于评分的方法。我们比较了每种方法在不同 censoring 水平下的性能与朴素贝叶斯分类器和比例风险模型的性能。我们还在来自真实医疗领域的几个数据集上进行了额外的实验。机器学习方法将数据中的 censored 案例视为无事件。

结果

我们报告并比较了几种常用模型评估指标的结果。平均而言,比例风险方法在大多数 censoring 设置中都优于其他方法。作为模拟研究的一部分,我们还分析了学习网络的结构相似性。与无 censoring 相比,重度 censoring 最多会产生 5%的额外和 10%的总缺失弧。它还会产生多达 50%的原本应连接到感兴趣变量的缺失弧。

结论

所提出的从数据中学习贝叶斯网络的方法可以用于在存在轻度 censoring(最多 20%)的情况下从 censored 生存数据中学习,将 censored 案例视为无事件。对于中等或重度 censoring,学习模型会针对多数类进行调整,因此需要采用不同的方法。

相似文献

1
Impact of censoring on learning Bayesian networks in survival modelling.生存模型中删失数据对贝叶斯网络学习的影响。
Artif Intell Med. 2009 Nov;47(3):199-217. doi: 10.1016/j.artmed.2009.08.001. Epub 2009 Oct 14.
2
Learning Bayesian networks from survival data using weighting censored instances.使用加权删失实例从生存数据中学习贝叶斯网络。
J Biomed Inform. 2010 Aug;43(4):613-22. doi: 10.1016/j.jbi.2010.03.005. Epub 2010 Mar 21.
3
Bayesian networks for multivariate data analysis and prognostic modelling in cardiac surgery.用于心脏手术中多变量数据分析和预后建模的贝叶斯网络。
Stud Health Technol Inform. 2007;129(Pt 1):596-600.
4
Prognostic Bayesian networks I: rationale, learning procedure, and clinical use.预后贝叶斯网络I:基本原理、学习过程及临床应用。
J Biomed Inform. 2007 Dec;40(6):609-18. doi: 10.1016/j.jbi.2007.07.003. Epub 2007 Jul 25.
5
Dynamic Bayesian networks as prognostic models for clinical patient management.动态贝叶斯网络作为临床患者管理的预后模型。
J Biomed Inform. 2008 Aug;41(4):515-29. doi: 10.1016/j.jbi.2008.01.006. Epub 2008 Feb 5.
6
A new machine learning classifier for high dimensional healthcare data.一种用于高维医疗数据的新型机器学习分类器。
Stud Health Technol Inform. 2007;129(Pt 1):664-8.
7
Exploiting missing clinical data in Bayesian network modeling for predicting medical problems.在贝叶斯网络建模中利用缺失临床数据预测医疗问题。
J Biomed Inform. 2008 Feb;41(1):1-14. doi: 10.1016/j.jbi.2007.06.001. Epub 2007 Jun 9.
8
A comparison of learning algorithms for Bayesian networks: a case study based on data from an emergency medical service.贝叶斯网络学习算法的比较:基于紧急医疗服务数据的案例研究
Artif Intell Med. 2004 Mar;30(3):215-32. doi: 10.1016/j.artmed.2003.11.002.
9
[Meta-analysis of the Italian studies on short-term effects of air pollution].[意大利关于空气污染短期影响研究的荟萃分析]
Epidemiol Prev. 2001 Mar-Apr;25(2 Suppl):1-71.
10
Bayesian random-effects threshold regression with application to survival data with nonproportional hazards.贝叶斯随机效应阈值回归及其在非比例风险生存数据中的应用。
Biostatistics. 2010 Jan;11(1):111-26. doi: 10.1093/biostatistics/kxp041. Epub 2009 Oct 14.

引用本文的文献

1
Limitations of Binary Classification for Long-Horizon Diagnosis Prediction and Advantages of a Discrete-Time Time-to-Event Approach: Empirical Analysis.长期诊断预测的二元分类局限性及离散时间事件发生时间方法的优势:实证分析
JMIR AI. 2025 Mar 27;4:e62985. doi: 10.2196/62985.
2
Development and Validation of a Bayesian Network-Based Model for Predicting Coronary Heart Disease Risk From Electronic Health Records.基于贝叶斯网络的电子健康记录预测冠心病风险模型的建立与验证。
J Am Heart Assoc. 2024 Jan 2;13(1):e029400. doi: 10.1161/JAHA.123.029400. Epub 2023 Dec 29.
3
Diabetes mellitus early warning and factor analysis using ensemble Bayesian networks with SMOTE-ENN and Boruta.
基于 SMOTE-ENN 和 Boruta 的集成贝叶斯网络对糖尿病进行早期预警和因素分析。
Sci Rep. 2023 Aug 5;13(1):12718. doi: 10.1038/s41598-023-40036-5.
4
A novel dynamic Bayesian network approach for data mining and survival data analysis.一种用于数据挖掘和生存数据分析的新型动态贝叶斯网络方法。
BMC Med Inform Decis Mak. 2022 Sep 22;22(1):251. doi: 10.1186/s12911-022-02000-7.
5
CondiS web app: imputation of censored lifetimes for machine learning-based survival analysis.CondiS 网络应用程序:基于机器学习的生存分析中删失寿命的推断。
Bioinformatics. 2022 Sep 2;38(17):4252-4254. doi: 10.1093/bioinformatics/btac461.
6
CondiS: A conditional survival distribution-based method for censored data imputation overcoming the hurdle in machine learning-based survival analysis.CondiS:一种基于条件生存分布的有删失数据插补方法,克服了基于机器学习的生存分析中的障碍。
J Biomed Inform. 2022 Jul;131:104117. doi: 10.1016/j.jbi.2022.104117. Epub 2022 Jun 9.
7
Application of a novel hybrid algorithm of Bayesian network in the study of hyperlipidemia related factors: a cross-sectional study.贝叶斯网络混合算法在高脂血症相关因素研究中的应用:一项横断面研究。
BMC Public Health. 2021 Jul 12;21(1):1375. doi: 10.1186/s12889-021-11412-5.
8
Learning rule sets from survival data.从生存数据中学习规则集。
BMC Bioinformatics. 2017 May 30;18(1):285. doi: 10.1186/s12859-017-1693-x.
9
Adapting machine learning techniques to censored time-to-event health record data: A general-purpose approach using inverse probability of censoring weighting.使机器学习技术适用于删失的事件发生时间健康记录数据:一种使用删失加权逆概率的通用方法。
J Biomed Inform. 2016 Jun;61:119-31. doi: 10.1016/j.jbi.2016.03.009. Epub 2016 Mar 16.
10
A Naive Bayes machine learning approach to risk prediction using censored, time-to-event data.一种使用删失的事件发生时间数据进行风险预测的朴素贝叶斯机器学习方法。
Stat Med. 2015 Sep 20;34(21):2941-57. doi: 10.1002/sim.6526. Epub 2015 May 18.