Department of Chemistry, Center for Drug Discovery, Design, and Delivery (CD4), Center for Scientific Computation, Southern Methodist University, Dallas, Texas, 75275.
Protein Sci. 2018 Feb;27(2):421-430. doi: 10.1002/pro.3329. Epub 2017 Nov 2.
PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics.
PAS 结构域广泛存在于古菌、细菌和真核生物中,在各种功能中发挥着重要作用。在这项研究中,我们旨在根据序列-结构-动力学-功能关系,探索 PAS 结构域超家族中蛋白质的功能进化关系。我们从 RCSB Protein Data Bank 收集了属于三种生物功能(核苷酸结合、光受体活性和转移酶活性)的 PAS 结构域超家族的蛋白质序列和晶体结构数据。对蛋白质序列进行比对,然后用于选择序列保守残基并构建系统发育树。还应用三维结构比对获得结构保守残基。使用弹性网络模型(ENM)分析蛋白质动力学,并通过分子动力学(MD)模拟进行验证。结果表明,具有相同功能的蛋白质可以通过序列相似性进行分组,并且不同功能组的蛋白质在振动模式上存在统计学上显著差异。有趣的是,在所有三个功能组中,通过序列和结构保守性分析鉴定的保守氨基酸残基的波动通常低于其他残基。此外,每个生物学功能组中保守残基的波动与相应的生物学功能强烈相关。这项研究表明,蛋白质序列通过结构动力学与各种功能直接相关。这是使用序列、结构和动力学的综合信息来描绘蛋白质功能进化的新尝试。