Suppr超能文献

一种采用平衡信息先验的贝叶斯建模方法来分析不均衡数据

A Bayesian Modelling Approach with Balancing Informative Prior for Analysing Imbalanced Data.

作者信息

Klein Kerenaftali, Hennig Stefanie, Paul Sanjoy Ketan

机构信息

Clinical Trials and Biostatistics Unit, QIMR Berghofer Medical Research Institute, 300 Herston Road, Brisbane, Australia.

School of Pharmacy, University of Queensland, 20 Cornwall Street, Brisbane, Australia.

出版信息

PLoS One. 2016 Apr 12;11(4):e0152700. doi: 10.1371/journal.pone.0152700. eCollection 2016.

Abstract

When a dataset is imbalanced, the prediction of the scarcely-sampled subpopulation can be over-influenced by the population contributing to the majority of the data. The aim of this study was to develop a Bayesian modelling approach with balancing informative prior so that the influence of imbalance to the overall prediction could be minimised. The new approach was developed in order to weigh the data in favour of the smaller subset(s). The method was assessed in terms of bias and precision in predicting model parameter estimates of simulated datasets. Moreover, the method was evaluated in predicting optimal dose levels of tobramycin for various age groups in a motivating example. The bias estimates using the balancing informative prior approach were smaller than those generated using the conventional approach which was without the consideration for the imbalance in the datasets. The precision estimates were also superior. The method was further evaluated in a motivating example of optimal dosage prediction of tobramycin. The resulting predictions also agreed well with what had been reported in the literature. The proposed Bayesian balancing informative prior approach has shown a real potential to adequately weigh the data in favour of smaller subset(s) of data to generate robust prediction models.

摘要

当数据集不均衡时,对采样稀少的亚群体的预测可能会受到构成大部分数据的群体的过度影响。本研究的目的是开发一种具有平衡信息先验的贝叶斯建模方法,以便将不均衡对整体预测的影响降至最低。开发新方法是为了权衡数据,使其有利于较小的子集。该方法在预测模拟数据集的模型参数估计时,从偏差和精度方面进行了评估。此外,在一个实际例子中,该方法用于预测不同年龄组妥布霉素的最佳剂量水平。使用平衡信息先验方法得到的偏差估计值小于使用不考虑数据集不均衡的传统方法产生的偏差估计值。精度估计也更优。该方法在妥布霉素最佳剂量预测的实际例子中得到了进一步评估。所得预测结果也与文献报道的结果非常吻合。所提出的贝叶斯平衡信息先验方法已显示出真正的潜力,能够充分权衡数据,有利于较小的数据子集,从而生成稳健的预测模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e677/4829197/20db289bf0e2/pone.0152700.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验