Kirk Isa Kristina, Weinhold Nils, Brunak Søren, Belling Kirstine
Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
Memorial Sloan Kettering Cancer Center, Computational Biology Program, New York, NY, United States of America.
PLoS One. 2017 Sep 14;12(9):e0179112. doi: 10.1371/journal.pone.0179112. eCollection 2017.
Conserved synteny denotes evolutionary preserved gene order across species. It is not well understood to which degree functional relationships between genes are preserved in syntenic blocks. Here we investigate whether protein-coding genes conserved in mammalian syntenic blocks encode gene products that serve the common functional purpose of interacting at protein level, i.e. connectivity. High connectivity among protein-protein interactions (PPIs) was only moderately associated with conserved synteny on a genome-wide scale. However, we observed a smaller subset of 3.6% of all syntenic blocks with high-confidence PPIs that had significantly higher connectivity than expected by random. Additionally, syntenic blocks with high-confidence PPIs contained significantly more chromatin loops than the remaining blocks, indicating functional preservation among these syntenic blocks. Conserved synteny is typically defined by sequence similarity. In this study, we also examined whether a functional relationship, here PPI connectivity, can identify syntenic blocks independently of orthology. While orthology-based syntenic blocks with high-confident PPIs and the connectivity-based syntenic blocks largely overlapped, the connectivity-based approach identified additional syntenic blocks that were not found by conventional sequence-based methods alone. Additionally, the connectivity-based approach enabled identification of potential orthologous genes between species. Our analyses demonstrate that subsets of syntenic blocks are associated with highly connected proteins, and that PPI connectivity can be used to detect conserved synteny even if sequence conservation drifts beyond what orthology algorithms normally can identify.
保守同线性表示跨物种进化保留的基因顺序。目前尚不清楚基因之间的功能关系在同线性区域中保留的程度。在这里,我们研究了在哺乳动物同线性区域中保守的蛋白质编码基因是否编码在蛋白质水平上具有相互作用这一共同功能目的的基因产物,即连通性。在全基因组范围内,蛋白质 - 蛋白质相互作用(PPI)之间的高连通性仅与保守同线性存在适度关联。然而,我们观察到所有具有高可信度PPI的同线性区域中有3.6%的较小子集,其连通性显著高于随机预期。此外,具有高可信度PPI的同线性区域比其余区域包含更多的染色质环,这表明这些同线性区域之间存在功能保留。保守同线性通常由序列相似性定义。在本研究中,我们还研究了一种功能关系,即这里的PPI连通性,是否可以独立于直系同源性来识别同线性区域。虽然基于直系同源性且具有高可信度PPI的同线性区域与基于连通性的同线性区域在很大程度上重叠,但基于连通性的方法识别出了仅靠传统基于序列的方法未发现的额外同线性区域。此外,基于连通性的方法能够识别物种之间潜在的直系同源基因。我们的分析表明,同线性区域的子集与高度连通的蛋白质相关,并且即使序列保守性偏离了直系同源性算法通常能够识别的范围,PPI连通性也可用于检测保守同线性。