Suppr超能文献

PAS结构域。基于结构预测对PAS结构域的重新定义。

The PAS fold. A redefinition of the PAS domain based upon structural prediction.

作者信息

Hefti Marco H, Françoijs Kees-Jan, de Vries Sacco C, Dixon Ray, Vervoort Jacques

机构信息

Laboratory of Biochemistry, Wageningen University, the Netherlands.

出版信息

Eur J Biochem. 2004 Mar;271(6):1198-208. doi: 10.1111/j.1432-1033.2004.04023.x.

Abstract

In the postgenomic era it is essential that protein sequences are annotated correctly in order to help in the assignment of their putative functions. Over 1300 proteins in current protein sequence databases are predicted to contain a PAS domain based upon amino acid sequence alignments. One of the problems with the current annotation of the PAS domain is that this domain exhibits limited similarity at the amino acid sequence level. It is therefore essential, when using proteins with low-sequence similarities, to apply profile hidden Markov model searches for the PAS domain-containing proteins, as for the PFAM database. From recent 3D X-ray and NMR structures, however, PAS domains appear to have a conserved 3D fold as shown here by structural alignment of the six representative 3D-structures from the PDB database. Large-scale modelling of the PAS sequences from the PFAM database against the 3D-structures of these six structural prototypes was performed. All 3D models generated (> 5700) were evaluated using prosaii. We conclude from our large-scale modelling studies that the PAS and PAC motifs (which are separately defined in the PFAM database) are directly linked and that these two motifs form the PAS fold. The existing subdivision in PAS and PAC motifs, as used by the PFAM and SMART databases, appears to be caused by major differences in sequences in the region connecting these two motifs. This region, as has been shown by Gardner and coworkers for human PAS kinase (Amezcua, C.A., Harper, S.M., Rutter, J. & Gardner, K.H. (2002) Structure 10, 1349-1361, [1]), is very flexible and adopts different conformations depending on the bound ligand. Some PAS sequences present in the PFAM database did not produce a good structural model, even after realignment using a structure-based alignment method, suggesting that these representatives are unlikely to have a fold resembling any of the structural prototypes of the PAS domain superfamily.

摘要

在后基因组时代,正确注释蛋白质序列对于帮助确定其假定功能至关重要。基于氨基酸序列比对,目前蛋白质序列数据库中有超过1300种蛋白质被预测含有一个PAS结构域。当前对PAS结构域注释存在的一个问题是,该结构域在氨基酸序列水平上的相似性有限。因此,在使用低序列相似性的蛋白质时,对于含PAS结构域的蛋白质,像在PFAM数据库中那样,应用轮廓隐马尔可夫模型搜索是必不可少的。然而,从最近的3D X射线和核磁共振结构来看,PAS结构域似乎具有保守的3D折叠,如通过PDB数据库中六个代表性3D结构的结构比对所示。针对这六个结构原型的3D结构,对PFAM数据库中的PAS序列进行了大规模建模。使用prosaii对生成的所有3D模型(超过5700个)进行了评估。我们从大规模建模研究中得出结论,PAS和PAC基序(在PFAM数据库中是分别定义的)直接相连,并且这两个基序形成了PAS折叠。PFAM和SMART数据库所采用的PAS和PAC基序的现有细分,似乎是由连接这两个基序的区域中序列的重大差异所导致的。正如Gardner及其同事对人PAS激酶所表明的那样(Amezcua, C.A., Harper, S.M., Rutter, J. & Gardner, K.H. (2002) Structure 10, 1349 - 1361, [1]),这个区域非常灵活,并且根据结合的配体采取不同的构象。即使使用基于结构的比对方法重新比对后,PFAM数据库中存在的一些PAS序列也没有产生良好的结构模型,这表明这些代表序列不太可能具有类似于PAS结构域超家族任何结构原型的折叠。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验