IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):869-877. doi: 10.1109/TCBB.2016.2642184. Epub 2016 Dec 20.
Detecting perturbation in modular structure during HIV-1 disease progression is an important step to understand stage specific infection pattern of HIV-1 virus in human cell. In this article, we proposed a novel methodology on integration of multiple biological information to identify such disruption in human gene module during different stages of HIV-1 infection. We integrate three different biological information: gene expression information, protein-protein interaction information, and gene ontology information in single gene meta-module, through non negative matrix factorization (NMF). As the identified meta-modules inherit those information so, detecting perturbation of these, reflects the changes in expression pattern, in PPI structure and in functional similarity of genes during the infection progression. To integrate modules of different data sources into strong meta-modules, NMF based clustering is utilized here. Perturbation in meta-modular structure is identified by investigating the topological and intramodular properties and putting rank to those meta-modules using a rank aggregation algorithm. We have also analyzed the preservation structure of significant GO terms in which the human proteins of the meta-modules participate. Moreover, we have performed an analysis to show the change of coregulation pattern of identified transcription factors (TFs) over the HIV progression stages.
检测 HIV-1 疾病进展过程中模块结构的扰动是理解 HIV-1 病毒在人体细胞中特定阶段感染模式的重要步骤。在本文中,我们提出了一种新的方法,将多种生物信息整合到人类基因模块中,以识别 HIV-1 感染不同阶段的这种破坏。我们通过非负矩阵分解(NMF)将三种不同的生物信息:基因表达信息、蛋白质-蛋白质相互作用信息和基因本体论信息整合到单个基因元模块中。由于所识别的元模块继承了这些信息,因此检测这些元模块的扰动反映了在感染进展过程中基因表达模式、PPI 结构和功能相似性的变化。为了将不同数据源的模块集成到强元模块中,这里使用了基于 NMF 的聚类。通过研究拓扑和模块内特性以及使用秩聚合算法对这些元模块进行排序,来识别元模块结构的扰动。我们还分析了元模块中参与的人类蛋白质的显著 GO 术语的保存结构。此外,我们还进行了分析,以显示在 HIV 进展阶段识别的转录因子(TFs)的核心调控模式的变化。