Department of Computing, Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia.
J Biomed Inform. 2010 Aug;43(4):613-22. doi: 10.1016/j.jbi.2010.03.005. Epub 2010 Mar 21.
Different survival data pre-processing procedures and adaptations of existing machine-learning techniques have been successfully applied to numerous fields in clinical medicine. Zupan et al. (2000) proposed handling censored survival data by assigning distributions of outcomes to shortly observed censored instances. In this paper, we applied their learning technique to two well-known procedures for learning Bayesian networks: a search-and-score hill-climbing algorithm and a constraint-based conditional independence algorithm. The method was thoroughly tested in a simulation study and on the publicly available clinical dataset GBSG2. We compared it to learning Bayesian networks by treating censored instances as event-free and to Cox regression. The results on model performance suggest that the weighting approach performs best when dealing with intermediate censoring. There is no significant difference between the model structures learnt using either the weighting approach or by treating censored instances as event-free, regardless of censoring.
不同的生存数据预处理程序和现有的机器学习技术的改编已成功应用于临床医学的众多领域。Zupan 等人(2000 年)提出通过将结果的分布分配给短时间观察到的删失实例来处理删失生存数据。在本文中,我们将他们的学习技术应用于学习贝叶斯网络的两种著名方法:搜索和评分爬山算法和基于约束的条件独立性算法。该方法在模拟研究和公开的临床数据集 GBSG2 上进行了彻底的测试。我们将其与将删失实例视为无事件的学习贝叶斯网络和 Cox 回归进行了比较。模型性能的结果表明,加权方法在处理中度删失时表现最佳。无论删失情况如何,使用加权方法或将删失实例视为无事件来学习模型结构之间都没有显著差异。