Li Lei M, Lu Henry Horng-Shing
Molecular and Computational Biology Program, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA.
J Comput Biol. 2005 Mar;12(2):170-85. doi: 10.1089/cmb.2005.12.170.
We consider the structure of directed acyclic Boolean (DAB) networks as a tool for exploring biological pathways. In a DAB network, the basic objects are binary elements and their Boolean duals. A DAB is characterized by two kinds of pairwise relations: similarity and prerequisite. The latter is a partial order relation, namely, the on-status of one element is necessary for the on-status of another element. A DAB network is uniquely determined by the state space of its elements. We arrange samples from the state space of a DAB network in a binary array and introduce a random mechanism of measurement error. Our inference strategy consists of two stages. First, we consider each pair of elements and try to identify their most likely relation. In the meantime, we assign a score, s-p-score, to this relation. Second, we rank the s-p-scores obtained from the first stage. We expect that relations with smaller s-p-scores are more likely to be true, and those with larger s-p-scores are more likely to be false. The key idea is the definition of s-scores (referring to similarity), p-scores (referring to prerequisite), and s-p-scores. As with classical statistical tests, control of false negatives and false positives are our primary concerns. We illustrate the method by a simulated example, the classical arginine biosynthetic pathway, and show some exploratory results on a published microarray expression dataset of yeast Saccharomyces cerevisiae obtained from experiments with activation and genetic perturbation of the pheromone response MAPK pathway.
我们将有向无环布尔(DAB)网络的结构视为探索生物途径的一种工具。在一个DAB网络中,基本对象是二元元素及其布尔对偶。一个DAB由两种成对关系来表征:相似性和先决条件。后者是一种偏序关系,即一个元素的开启状态是另一个元素开启状态的必要条件。一个DAB网络由其元素的状态空间唯一确定。我们将来自DAB网络状态空间的样本排列在一个二元数组中,并引入测量误差的随机机制。我们的推理策略包括两个阶段。首先,我们考虑每一对元素,并试图确定它们最可能的关系。同时,我们为这种关系赋予一个分数,即s-p分数。其次,我们对从第一阶段获得的s-p分数进行排序。我们期望具有较小s-p分数的关系更可能是真实的,而具有较大s-p分数的关系更可能是错误的。关键思想是s分数(指相似性)、p分数(指先决条件)和s-p分数的定义。与经典统计检验一样,控制假阴性和假阳性是我们主要关注的问题。我们通过一个模拟示例、经典的精氨酸生物合成途径来说明该方法,并展示了对已发表的酿酒酵母微阵列表达数据集的一些探索性结果,该数据集是通过对信息素反应MAPK途径进行激活和基因扰动的实验获得的。