Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, Japan.
Blue Marble Space Institute of Science, Seattle, Washington, United States of America.
PLoS Comput Biol. 2022 Feb 14;18(2):e1009833. doi: 10.1371/journal.pcbi.1009833. eCollection 2022 Feb.
As sequence and structure comparison algorithms gain sensitivity, the intrinsic interconnectedness of the protein universe has become increasingly apparent. Despite this general trend, β-trefoils have emerged as an uncommon counterexample: They are an isolated protein lineage for which few, if any, sequence or structure associations to other lineages have been identified. If β-trefoils are, in fact, remote islands in sequence-structure space, it implies that the oligomerizing peptide that founded the β-trefoil lineage itself arose de novo. To better understand β-trefoil evolution, and to probe the limits of fragment sharing across the protein universe, we identified both 'β-trefoil bridging themes' (evolutionarily-related sequence segments) and 'β-trefoil-like motifs' (structure motifs with a hallmark feature of the β-trefoil architecture) in multiple, ostensibly unrelated, protein lineages. The success of the present approach stems, in part, from considering β-trefoil sequence segments or structure motifs rather than the β-trefoil architecture as a whole, as has been done previously. The newly uncovered inter-lineage connections presented here suggest a novel hypothesis about the origins of the β-trefoil fold itself-namely, that it is a derived fold formed by 'budding' from an Immunoglobulin-like β-sandwich protein. These results demonstrate how the evolution of a folded domain from a peptide need not be a signature of antiquity and underpin an emerging truth: few protein lineages escape nature's sewing table.
随着序列和结构比较算法的敏感性提高,蛋白质宇宙的内在关联性变得越来越明显。尽管存在这种总体趋势,但β-三叶草结构却成为一个罕见的反例:它们是一个孤立的蛋白质谱系,几乎没有(如果有的话)与其他谱系的序列或结构关联被确定。如果β-三叶草结构实际上是序列-结构空间中的遥远岛屿,这意味着构成β-三叶草谱系的寡聚肽本身是从头出现的。为了更好地理解β-三叶草结构的进化,并探究蛋白质宇宙中片段共享的极限,我们在多个表面上不相关的蛋白质谱系中识别了“β-三叶草桥接主题”(进化相关的序列片段)和“β-三叶草样基序”(具有β-三叶草结构标志性特征的结构基序)。本研究方法的成功部分源于考虑β-三叶草结构的序列片段或结构基序,而不是像以前那样将整个β-三叶草结构作为一个整体。这里新发现的谱系间连接表明了关于β-三叶草折叠本身起源的一个新假设,即它是由免疫球蛋白样β-三明治蛋白“萌芽”形成的衍生折叠。这些结果表明,从肽折叠结构域的进化不一定是古老的标志,并支持一个新兴的事实:很少有蛋白质谱系能逃脱自然的缝合台。