Department of Chemistry, Pennsylvania State University, University Park, PA, United States.
Institute for Computational and Data Sciences, Pennsylvania State University, University Park, PA, United States.
J Mol Biol. 2024 Mar 15;436(6):168459. doi: 10.1016/j.jmb.2024.168459. Epub 2024 Jan 30.
One-third of protein domains in the CATH database contain a recently discovered tertiary topological motif: non-covalent lasso entanglements, in which a segment of the protein backbone forms a loop closed by non-covalent interactions between residues and is threaded one or more times by the N- or C-terminal backbone segment. Unknown is how frequently this structural motif appears across the proteomes of organisms. And the correlation of these motifs with various classes of protein function and biological processes have not been quantified. Here, using a combination of protein crystal structures, AlphaFold2 predictions, and Gene Ontology terms we show that in E. coli, S. cerevisiae and H. sapiens that 71%, 52% and 49% of globular proteins contain one-or-more non-covalent lasso entanglements in their native fold, and that some of these are highly complex with multiple threading events. Further, proteins containing these tertiary motifs are consistently enriched in certain functions and biological processes across these organisms and depleted in others, strongly indicating an influence of evolutionary selection pressures acting positively and negatively on the distribution of these motifs. Together, these results demonstrate that non-covalent lasso entanglements are widespread and indicate they may be extensively utilized for protein function and subcellular processes, thus impacting phenotype.
三分之一的 CATH 数据库中的蛋白质结构域包含最近发现的三级拓扑结构模体:非共价套索环,其中蛋白质主链的一段通过残基之间的非共价相互作用形成一个环,并通过 N-或 C-末端主链段多次穿过。目前还不清楚这种结构模体在生物体的蛋白质组中出现的频率如何。而且,这些模体与各种类别的蛋白质功能和生物过程之间的相关性尚未量化。在这里,我们使用蛋白质晶体结构、AlphaFold2 预测和基因本体论术语的组合,表明在大肠杆菌、酿酒酵母和人类中,71%、52%和 49%的球状蛋白在其天然构象中含有一个或多个非共价套索环,其中一些具有多个穿线事件,非常复杂。此外,在这些生物体中,含有这些三级结构模体的蛋白质在某些功能和生物过程中始终富集,而在其他功能和生物过程中则耗尽,这强烈表明进化选择压力对这些模体的分布产生了积极和消极的影响。总之,这些结果表明非共价套索环非常普遍,并表明它们可能被广泛用于蛋白质功能和亚细胞过程,从而影响表型。