Page David, Ong Irene M
Department of Biostatistics & Medical Informatics, University of Wisconsin, Madison, WI 53706, USA.
Pac Symp Biocomput. 2006:267-78.
Bayesian networks (BNs) and dynamic Bayesian networks (DBNs) are becoming more widely used as a way to learn various types of networks, including cellular signaling networks, from high-throughput data. Due to the high cost of performing experiments, we are interested in developing an experimental design for time series data generation. Specifically, we are interested in determining properties of time series data that make them more efficient for DBN modeling. We present a theoretical analysis on the ability of DBNs without hidden variables to learn from proteomic time series data. The analysis reveals, among other lessons, that under a reasonable set of assumptions a fixed budget is better spent on collecting many short time series data than on a few long time series data.
贝叶斯网络(BNs)和动态贝叶斯网络(DBNs)作为一种从高通量数据中学习包括细胞信号网络在内的各种类型网络的方法,正得到越来越广泛的应用。由于进行实验的成本高昂,我们有兴趣开发一种用于生成时间序列数据的实验设计。具体而言,我们感兴趣的是确定时间序列数据的属性,这些属性使其在DBN建模中更有效。我们对无隐藏变量的DBN从蛋白质组学时间序列数据中学习的能力进行了理论分析。该分析揭示了诸多要点,其中包括在一组合理的假设下,固定预算用于收集许多短时间序列数据比用于少数长时间序列数据效果更好。