PAS结构域。基于结构预测对PAS结构域的重新定义。

The PAS fold. A redefinition of the PAS domain based upon structural prediction.

作者信息

Hefti Marco H, Françoijs Kees-Jan, de Vries Sacco C, Dixon Ray, Vervoort Jacques

机构信息

Laboratory of Biochemistry, Wageningen University, the Netherlands.

出版信息

Eur J Biochem. 2004 Mar;271(6):1198-208. doi: 10.1111/j.1432-1033.2004.04023.x.

DOI:10.1111/j.1432-1033.2004.04023.x

PMID:15009198

Abstract

In the postgenomic era it is essential that protein sequences are annotated correctly in order to help in the assignment of their putative functions. Over 1300 proteins in current protein sequence databases are predicted to contain a PAS domain based upon amino acid sequence alignments. One of the problems with the current annotation of the PAS domain is that this domain exhibits limited similarity at the amino acid sequence level. It is therefore essential, when using proteins with low-sequence similarities, to apply profile hidden Markov model searches for the PAS domain-containing proteins, as for the PFAM database. From recent 3D X-ray and NMR structures, however, PAS domains appear to have a conserved 3D fold as shown here by structural alignment of the six representative 3D-structures from the PDB database. Large-scale modelling of the PAS sequences from the PFAM database against the 3D-structures of these six structural prototypes was performed. All 3D models generated (> 5700) were evaluated using prosaii. We conclude from our large-scale modelling studies that the PAS and PAC motifs (which are separately defined in the PFAM database) are directly linked and that these two motifs form the PAS fold. The existing subdivision in PAS and PAC motifs, as used by the PFAM and SMART databases, appears to be caused by major differences in sequences in the region connecting these two motifs. This region, as has been shown by Gardner and coworkers for human PAS kinase (Amezcua, C.A., Harper, S.M., Rutter, J. & Gardner, K.H. (2002) Structure 10, 1349-1361, [1]), is very flexible and adopts different conformations depending on the bound ligand. Some PAS sequences present in the PFAM database did not produce a good structural model, even after realignment using a structure-based alignment method, suggesting that these representatives are unlikely to have a fold resembling any of the structural prototypes of the PAS domain superfamily.

摘要

在后基因组时代，正确注释蛋白质序列对于帮助确定其假定功能至关重要。基于氨基酸序列比对，目前蛋白质序列数据库中有超过1300种蛋白质被预测含有一个PAS结构域。当前对PAS结构域注释存在的一个问题是，该结构域在氨基酸序列水平上的相似性有限。因此，在使用低序列相似性的蛋白质时，对于含PAS结构域的蛋白质，像在PFAM数据库中那样，应用轮廓隐马尔可夫模型搜索是必不可少的。然而，从最近的3D X射线和核磁共振结构来看，PAS结构域似乎具有保守的3D折叠，如通过PDB数据库中六个代表性3D结构的结构比对所示。针对这六个结构原型的3D结构，对PFAM数据库中的PAS序列进行了大规模建模。使用prosaii对生成的所有3D模型（超过5700个）进行了评估。我们从大规模建模研究中得出结论，PAS和PAC基序（在PFAM数据库中是分别定义的）直接相连，并且这两个基序形成了PAS折叠。PFAM和SMART数据库所采用的PAS和PAC基序的现有细分，似乎是由连接这两个基序的区域中序列的重大差异所导致的。正如Gardner及其同事对人PAS激酶所表明的那样（Amezcua, C.A., Harper, S.M., Rutter, J. & Gardner, K.H. (2002) Structure 10, 1349 - 1361, [1]），这个区域非常灵活，并且根据结合的配体采取不同的构象。即使使用基于结构的比对方法重新比对后，PFAM数据库中存在的一些PAS序列也没有产生良好的结构模型，这表明这些代表序列不太可能具有类似于PAS结构域超家族任何结构原型的折叠。

相似文献

The PAS fold. A redefinition of the PAS domain based upon structural prediction.

Eur J Biochem. 2004 Mar;271(6):1198-208. doi: 10.1111/j.1432-1033.2004.04023.x.

SUPFAM--a database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families: implications for structural genomics and function annotation in genomes.

Nucleic Acids Res. 2002 Jan 1;30(1):289-93. doi: 10.1093/nar/30.1.289.

Photoactive yellow protein: a structural prototype for the three-dimensional fold of the PAS domain superfamily.

Proc Natl Acad Sci U S A. 1998 May 26;95(11):5884-90. doi: 10.1073/pnas.95.11.5884.

Structural and functional studies of S-adenosyl-L-methionine binding proteins: a ligand-centric approach.

BMC Struct Biol. 2013 Apr 25;13:6. doi: 10.1186/1472-6807-13-6.

Assignment of protein sequences to existing domain and family classification systems: Pfam and the PDB.

Bioinformatics. 2012 Nov 1;28(21):2763-72. doi: 10.1093/bioinformatics/bts533. Epub 2012 Aug 31.

Identifying protein domains with the Pfam database.

Curr Protoc Bioinformatics. 2003 May;Chapter 2:Unit 2.5. doi: 10.1002/0471250953.bi0205s01.

PASS2: an automated database of protein alignments organised as structural superfamilies.

BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.

Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases.

Bioinformatics. 2000 Nov;16(11):988-1002. doi: 10.1093/bioinformatics/16.11.988.

Structures of the first representatives of Pfam family PF06938 (DUF1285) reveal a new fold with repeated structural motifs and possible involvement in signal transduction.

Acta Crystallogr Sect F Struct Biol Cryst Commun. 2010 Oct 1;66(Pt 10):1218-25. doi: 10.1107/S1744309109050416. Epub 2010 Mar 5.

Pfam: a comprehensive database of protein domain families based on seed alignments.

Proteins. 1997 Jul;28(3):405-20. doi: 10.1002/(sici)1097-0134(199707)28:3<405::aid-prot10>3.0.co;2-l.

引用本文的文献

Protein functional domain analysis enhances genotype-phenotype associations in comparative genomic studies of .

Front Microbiol. 2025 Aug 6;16:1569118. doi: 10.3389/fmicb.2025.1569118. eCollection 2025.

The cycle gene is essential for both daily responses and seasonal reproduction in the Northern house mosquito, Culex pipiens.

Sci Rep. 2025 Aug 2;15(1):28279. doi: 10.1038/s41598-025-06637-y.

Insects in agricultural greenhouses: a metagenomic analysis of microbes in infesting tomato and cucumber crops.

Front Plant Sci. 2025 May 19;16:1581707. doi: 10.3389/fpls.2025.1581707. eCollection 2025.

Structural assembly of the PAS domain drives the catalytic activation of metazoan PASK.

Proc Natl Acad Sci U S A. 2025 Mar 25;122(12):e2409685122. doi: 10.1073/pnas.2409685122. Epub 2025 Mar 19.

Diverse non-canonical electron bifurcating [FeFe]-hydrogenases of separate evolutionary origins in .

mSystems. 2024 Sep 17;9(9):e0099924. doi: 10.1128/msystems.00999-24. Epub 2024 Aug 27.

Nutrient Signaling-Dependent Quaternary Structure Remodeling Drives the Catalytic Activation of metazoan PASK.

bioRxiv. 2024 Jun 28:2024.06.28.599394. doi: 10.1101/2024.06.28.599394.

A Model of the Full-Length Cytokinin Receptor: New Insights and Prospects.

Int J Mol Sci. 2023 Dec 20;25(1):73. doi: 10.3390/ijms25010073.

Dimerization Rules of Mammalian PAS Proteins.

J Mol Biol. 2024 Feb 1;436(3):168406. doi: 10.1016/j.jmb.2023.168406. Epub 2023 Dec 16.

The GGDEF protein Dgc2 suppresses both motility and biofilm formation in the filamentous cyanobacterium .

Microbiol Spectr. 2023 Sep 1;11(5):e0483722. doi: 10.1128/spectrum.04837-22.

Genomic insights into the c-di-GMP signaling and biofilm development in the saprophytic spirochete Leptospira biflexa.

Arch Microbiol. 2023 Apr 8;205(5):180. doi: 10.1007/s00203-023-03519-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PAS结构域。基于结构预测对PAS结构域的重新定义。

The PAS fold. A redefinition of the PAS domain based upon structural prediction.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献