利用序列衍生预测进行蛋白质折叠识别。

Protein fold recognition using sequence-derived predictions.

作者信息

Fischer D, Eisenberg D

机构信息

UCLA-DOE Laboratory of Structural Biology & Molecular Medicine, Molecular Biology Institute 90095-1570, USA.

出版信息

Protein Sci. 1996 May;5(5):947-55. doi: 10.1002/pro.5560050516.

DOI:10.1002/pro.5560050516

PMID:8732766

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2143416/

Abstract

In protein fold recognition, one assigns a probe amino acid sequence of unknown structure to one of a library of target 3D structures. Correct assignment depends on effective scoring of the probe sequence for its compatibility with each of the target structures. Here we show that, in addition to the amino acid sequence of the probe, sequence-derived properties of the probe sequence (such as the predicted secondary structure) are useful in fold assignment. The additional measure of compatibility between probe and target is the level of agreement between the predicted secondary structure of the probe and the known secondary structure of the target fold. That is, we recommend a sequence-structure compatibility function that combines previously developed compatibility functions (such as the 3D-1D scores of Bowie et al. [1991] or sequence-sequence replacement tables) with the predicted secondary structure of the probe sequence. The effect on fold assignment of adding predicted secondary structure is evaluated here by using a benchmark set of proteins (Fischer et al., 1996a). The 3D structures of the probe sequences of the benchmark are actually known, but are ignored by our method. The results show that the inclusion of the predicted secondary structure improves fold assignment by about 25%. The results also show that, if the true secondary structure of the probe were known, correct fold assignment would increase by an additional 8-32%. We conclude that incorporating sequence-derived predictions significantly improves assignment of sequences to known 3D folds. Finally, we apply the new method to assign folds to sequences in the SWISSPROT database; six fold assignments are given that are not detectable by standard sequence-sequence comparison methods; for two of these, the fold is known from X-ray crystallography and the fold assignment is correct.

摘要

在蛋白质折叠识别中，要将一个未知结构的探测氨基酸序列与一个目标三维结构库中的某一个进行匹配。正确的匹配取决于对探测序列与每个目标结构兼容性的有效评分。在此我们表明，除了探测序列的氨基酸序列外，探测序列的衍生性质（如预测的二级结构）在折叠匹配中也很有用。探测序列与目标结构之间兼容性的额外衡量标准是探测序列预测的二级结构与目标折叠已知二级结构之间的一致程度。也就是说，我们推荐一种序列 - 结构兼容性函数，该函数将先前开发的兼容性函数（如Bowie等人[1991]的三维 - 一维评分或序列 - 序列替换表）与探测序列的预测二级结构相结合。这里通过使用一组蛋白质基准集（Fischer等人，1996a）来评估添加预测二级结构对折叠匹配的影响。基准集中探测序列的三维结构实际上是已知的，但我们的方法忽略了它们。结果表明，纳入预测的二级结构可使折叠匹配的准确率提高约25%。结果还表明，如果探测序列的真实二级结构已知，正确的折叠匹配率将额外提高8 - 32%。我们得出结论，纳入序列衍生预测能显著提高将序列匹配到已知三维折叠的准确率。最后，我们将新方法应用于SWISSPROT数据库中序列的折叠匹配；给出了六个标准序列 - 序列比较方法无法检测到的折叠匹配；其中两个的折叠结构通过X射线晶体学已知且折叠匹配正确。

相似文献

Protein fold recognition using sequence-derived predictions.

Protein Sci. 1996 May;5(5):947-55. doi: 10.1002/pro.5560050516.

A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.

J Mol Biol. 1997 Apr 11;267(4):1026-38. doi: 10.1006/jmbi.1997.0924.

Protein structure prediction by threading methods: evaluation of current techniques.

Proteins. 1995 Nov;23(3):337-55. doi: 10.1002/prot.340230308.

Hidden Markov models that use predicted secondary structures for fold recognition.

Proteins. 1999 Jul 1;36(1):68-76.

Protein fold recognition by prediction-based threading.

J Mol Biol. 1997 Jul 18;270(3):471-80. doi: 10.1006/jmbi.1997.1101.

Protein fold recognition by mapping predicted secondary structures.

J Mol Biol. 1996 Jun 14;259(3):349-65. doi: 10.1006/jmbi.1996.0325.

Assessment of a protein fold recognition method that takes into account four physicochemical properties: side-chain packing, solvation, hydrogen-bonding, and local conformation.

Proteins. 1995 Nov;23(3):370-5. doi: 10.1002/prot.340230310.

[A turning point in the knowledge of the structure-function-activity relations of elastin].

J Soc Biol. 2001;195(2):181-93.

Assessment of protein fold predictions from sequence information: the predicted alpha/beta doubly wound fold of the von Willebrand factor type A domain is similar to its crystal structure.

J Mol Biol. 1996 Jul 12;260(2):277-85. doi: 10.1006/jmbi.1996.0398.

Improving fold recognition without folds.

J Mol Biol. 2004 Jul 30;341(1):255-69. doi: 10.1016/j.jmb.2004.05.041.

引用本文的文献

Prediction of protein secondary structure based on an improved channel attention and multiscale convolution module.

Front Bioeng Biotechnol. 2022 Jul 22;10:901018. doi: 10.3389/fbioe.2022.901018. eCollection 2022.

Determination of genetic effects and functional SNPs of bovine HTR1B gene on milk fatty acid traits.

BMC Genomics. 2021 Jul 27;22(1):575. doi: 10.1186/s12864-021-07893-8.

Multifaceted analysis of training and testing convolutional neural networks for protein secondary structure prediction.

PLoS One. 2020 May 6;15(5):e0232528. doi: 10.1371/journal.pone.0232528. eCollection 2020.

Characterization of structural and functional role of selenocysteine in selenoprotein H and its impact on DNA binding.

Amino Acids. 2018 May;50(5):593-607. doi: 10.1007/s00726-018-2543-5. Epub 2018 Feb 26.

Sixty-five years of the long march in protein secondary structure prediction: the final stretch?

Brief Bioinform. 2018 May 1;19(3):482-494. doi: 10.1093/bib/bbw129.

TMFoldRec: a statistical potential-based transmembrane protein fold recognition tool.

BMC Bioinformatics. 2015 Jun 30;16:201. doi: 10.1186/s12859-015-0638-5.

Application of data mining tools for classification of protein structural class from residue based averaged NMR chemical shifts.

Biochim Biophys Acta. 2015 Oct;1854(10 Pt A):1545-52. doi: 10.1016/j.bbapap.2015.02.016. Epub 2015 Mar 7.

PSS-3D1D: an improved 3D1D profile method of protein fold recognition for the annotation of twilight zone sequences.

J Struct Funct Genomics. 2011 Dec;12(4):181-9. doi: 10.1007/s10969-011-9119-x. Epub 2011 Dec 3.

Template-based protein structure modeling using TASSER(VMT.).

Proteins. 2012 Feb;80(2):352-61. doi: 10.1002/prot.23183. Epub 2011 Nov 22.

Including Functional Annotations and Extending the Collection of Structural Classifications of Protein Loops (ArchDB).

Bioinform Biol Insights. 2009 Nov 24;1:77-90.

本文引用的文献

A 3D sequence-independent representation of the protein data bank.

Protein Eng. 1995 Oct;8(10):981-97. doi: 10.1093/protein/8.10.981.

Protein structure prediction by threading methods: evaluation of current techniques.

Proteins. 1995 Nov;23(3):337-55. doi: 10.1002/prot.340230308.

Assigning amino acid sequences to 3-dimensional protein folds.

FASEB J. 1996 Jan;10(1):126-36. doi: 10.1096/fasebj.10.1.8566533.

An empirical energy function for threading protein sequence through the folding motif.

Proteins. 1993 May;16(1):92-112. doi: 10.1002/prot.340160110.

Three-dimensional profiles from residue-pair preferences: identification of sequences with beta/alpha-barrel fold.

Proc Natl Acad Sci U S A. 1993 Feb 15;90(4):1379-83. doi: 10.1073/pnas.90.4.1379.

Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures.

J Mol Biol. 1993 Aug 5;232(3):805-25. doi: 10.1006/jmbi.1993.1433.

Prediction of protein secondary structure at better than 70% accuracy.

J Mol Biol. 1993 Jul 20;232(2):584-99. doi: 10.1006/jmbi.1993.1413.

Protein fold recognition.

J Comput Aided Mol Des. 1993 Aug;7(4):439-56. doi: 10.1007/BF02337560.

Recognition of related proteins by iterative template refinement (ITR).

Protein Sci. 1994 Aug;3(8):1315-28. doi: 10.1002/pro.5560030818.

Crystal structure of AmiC: the controller of transcription antitermination in the amidase operon of Pseudomonas aeruginosa.

EMBO J. 1994 Dec 15;13(24):5810-7. doi: 10.1002/j.1460-2075.1994.tb06924.x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用序列衍生预测进行蛋白质折叠识别。

Protein fold recognition using sequence-derived predictions.

作者信息

Fischer D, Eisenberg D

机构信息

UCLA-DOE Laboratory of Structural Biology & Molecular Medicine, Molecular Biology Institute 90095-1570, USA.

出版信息

Protein Sci. 1996 May;5(5):947-55. doi: 10.1002/pro.5560050516.

DOI:10.1002/pro.5560050516

PMID:8732766

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2143416/

Abstract

摘要

利用序列衍生预测进行蛋白质折叠识别。

Protein fold recognition using sequence-derived predictions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用序列衍生预测进行蛋白质折叠识别。

Protein fold recognition using sequence-derived predictions.

作者信息

机构信息

出版信息