Department of Computer Science, Virginia Tech, Blacksburg, VA, USA.
Bioinformatics. 2013 Mar 1;29(5):622-9. doi: 10.1093/bioinformatics/btt007. Epub 2013 Jan 12.
Many techniques have been developed to compute the response network of a cell. A recent trend in this area is to compute response networks of small size, with the rationale that only part of a pathway is often changed by disease and that interpreting small subnetworks is easier than interpreting larger ones. However, these methods may not uncover the spectrum of pathways perturbed in a particular experiment or disease.
To avoid these difficulties, we propose to use algorithms that reconcile case-control DNA microarray data with a molecular interaction network by modifying per-gene differential expression P-values such that two genes connected by an interaction show similar changes in their gene expression values. We provide a novel evaluation of four methods from this class of algorithms. We enumerate three desirable properties that this class of algorithms should address. These properties seek to maintain that the returned gene rankings are specific to the condition being studied. Moreover, to ease interpretation, highly ranked genes should participate in coherent network structures and should be functionally enriched with relevant biological pathways. We comprehensively evaluate the extent to which each algorithm addresses these properties on a compendium of gene expression data for 54 diverse human diseases. We show that the reconciled gene rankings can identify novel disease-related functions that are missed by analyzing expression data alone.
C++ software implementing our algorithms is available in the NetworkReconciliation package as part of the Biorithm software suite under the GNU General Public License: http://bioinformatics.cs.vt.edu/∼murali/software/biorithm-docs.
已经开发出许多技术来计算细胞的反应网络。该领域的一个最新趋势是计算小尺寸的反应网络,其基本原理是,疾病通常仅改变途径的一部分,并且解释小的子网比解释更大的子网更容易。但是,这些方法可能无法揭示特定实验或疾病中受扰途径的范围。
为了避免这些困难,我们建议使用通过修改每个基因的差异表达 P 值来使病例对照 DNA 微阵列数据与分子相互作用网络相协调的算法,使得通过相互作用连接的两个基因在其基因表达值上显示出相似的变化。我们对该算法类中的四种方法进行了新颖的评估。我们列举了该算法类应解决的三个理想特性。这些特性旨在保持返回的基因排名特定于正在研究的条件。此外,为了便于解释,排名较高的基因应参与连贯的网络结构,并且应在功能上与相关的生物学途径丰富。我们全面评估了每种算法在包含 54 种不同人类疾病的基因表达数据综合集中解决这些特性的程度。我们表明,通过仅分析表达数据,协调后的基因排名可以识别出被忽略的新的疾病相关功能。
我们的算法的 C++软件可作为 Biorithm 软件套件的 NetworkReconciliation 包的一部分,根据 GNU 通用公共许可证获得:http://bioinformatics.cs.vt.edu/∼murali/software/biorithm-docs。