Koumakis Lefteris, Kanterakis Alexandros, Kartsaki Evgenia, Chatzimina Maria, Zervakis Michalis, Tsiknakis Manolis, Vassou Despoina, Kafetzopoulos Dimitris, Marias Kostas, Moustakis Vassilis, Potamias George
Computational BioMedicine Laboratory (CBML), Institute of Computers Science (ICS), Foundation for Research and Technology-Hellas (FORTH), Heraklion, Crete, Greece.
School of Electrical and Computer Engineering, Technical University of Crete, Greece.
PLoS Comput Biol. 2016 Nov 10;12(11):e1005187. doi: 10.1371/journal.pcbi.1005187. eCollection 2016 Nov.
Pathway analysis methodologies couple traditional gene expression analysis with knowledge encoded in established molecular pathway networks, offering a promising approach towards the biological interpretation of phenotype differentiating genes. Early pathway analysis methodologies, named as gene set analysis (GSA), view pathways just as plain lists of genes without taking into account either the underlying pathway network topology or the involved gene regulatory relations. These approaches, even if they achieve computational efficiency and simplicity, consider pathways that involve the same genes as equivalent in terms of their gene enrichment characteristics. Most recent pathway analysis approaches take into account the underlying gene regulatory relations by examining their consistency with gene expression profiles and computing a score for each profile. Even with this approach, assessing and scoring single-relations limits the ability to reveal key gene regulation mechanisms hidden in longer pathway sub-paths. We introduce MinePath, a pathway analysis methodology that addresses and overcomes the aforementioned problems. MinePath facilitates the decomposition of pathways into their constituent sub-paths. Decomposition leads to the transformation of single-relations to complex regulation sub-paths. Regulation sub-paths are then matched with gene expression sample profiles in order to evaluate their functional status and to assess phenotype differential power. Assessment of differential power supports the identification of the most discriminant profiles. In addition, MinePath assess the significance of the pathways as a whole, ranking them by their p-values. Comparison results with state-of-the-art pathway analysis systems are indicative for the soundness and reliability of the MinePath approach. In contrast with many pathway analysis tools, MinePath is a web-based system (www.minepath.org) offering dynamic and rich pathway visualization functionality, with the unique characteristic to color regulatory relations between genes and reveal their phenotype inclination. This unique characteristic makes MinePath a valuable tool for in silico molecular biology experimentation as it serves the biomedical researchers' exploratory needs to reveal and interpret the regulatory mechanisms that underlie and putatively govern the expression of target phenotypes.
通路分析方法将传统的基因表达分析与已建立的分子通路网络中编码的知识相结合,为区分表型的基因进行生物学解释提供了一种很有前景的方法。早期的通路分析方法,称为基因集分析(GSA),仅将通路视为简单的基因列表,而不考虑潜在的通路网络拓扑结构或所涉及的基因调控关系。这些方法即使实现了计算效率和简单性,但在基因富集特征方面,认为涉及相同基因的通路是等效的。最新的通路分析方法通过检查基因调控关系与基因表达谱的一致性并为每个谱计算一个分数,来考虑潜在的基因调控关系。即使采用这种方法,评估和对单一关系进行评分也限制了揭示隐藏在较长通路子路径中的关键基因调控机制的能力。我们引入了MinePath,这是一种解决并克服上述问题的通路分析方法。MinePath有助于将通路分解为其组成子路径。分解导致单一关系转变为复杂的调控子路径。然后将调控子路径与基因表达样本谱进行匹配,以评估其功能状态并评估表型差异能力。差异能力的评估有助于识别最具区分性的谱。此外,MinePath评估通路整体的显著性,并按p值对它们进行排名。与现有最先进的通路分析系统的比较结果表明了MinePath方法的合理性和可靠性。与许多通路分析工具不同,MinePath是一个基于网络的系统(www.minepath.org),提供动态且丰富的通路可视化功能,具有独特的特性,即对基因之间的调控关系进行着色并揭示其表型倾向。这一独特特性使MinePath成为计算机模拟分子生物学实验的宝贵工具,因为它满足了生物医学研究人员探索性的需求,以揭示和解释潜在的并假定控制目标表型表达的调控机制。