基于序列的蛋白质构象转变行为预测。

Sequence-Based Prediction of Metamorphic Behavior in Proteins.

机构信息

Department of Chemistry, University of California, Davis, California.

School of Natural Sciences, University of California, Merced, California.

出版信息

Biophys J. 2020 Oct 6;119(7):1380-1390. doi: 10.1016/j.bpj.2020.07.034. Epub 2020 Aug 14.

DOI:10.1016/j.bpj.2020.07.034

PMID:32937108

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7567988/

Abstract

An increasing number of proteins have been demonstrated in recent years to adopt multiple three-dimensional folds with different functions. These metamorphic proteins are characterized by having two or more folds with significant differences in their secondary structure, in which each fold is stabilized by a distinct local environment. So far, ∼90 metamorphic proteins have been identified in the Protein Databank, but we and others hypothesize that a far greater number of metamorphic proteins remain undiscovered. In this work, we introduce a computational model to predict metamorphic behavior in proteins using only knowledge of the sequence. In this model, secondary structure prediction programs are used to calculate diversity indices, which are measures of uncertainty in predicted secondary structure at each position in the sequence; these are then used to assign protein sequences as likely to be metamorphic versus monomorphic (i.e., having just one fold). We constructed a reference data set to train our classification method, which includes a novel compilation of 136 likely monomorphic proteins and a set of 201 metamorphic protein structures taken from the literature. Our model is able to classify proteins as metamorphic versus monomorphic with a Matthews correlation coefficient of ∼0.36 and true positive/true negative rates of ∼65%/80%, suggesting that it is possible to predict metamorphic behavior in proteins using only sequence information.

摘要

近年来，越来越多的蛋白质被证明可以采用具有不同功能的多种三维折叠。这些变形蛋白质的特征是具有两个或更多折叠，其二级结构有显著差异，其中每个折叠都由独特的局部环境稳定。到目前为止，在蛋白质数据库中已经鉴定了约 90 种变形蛋白，但我们和其他人假设，还有更多的变形蛋白尚未被发现。在这项工作中，我们引入了一种计算模型，仅使用序列知识来预测蛋白质的变形行为。在该模型中，使用二级结构预测程序来计算多样性指数，这是序列中每个位置预测二级结构不确定性的度量；然后，这些指数用于将蛋白质序列分配为可能是变形的还是单态的（即只有一种折叠）。我们构建了一个参考数据集来训练我们的分类方法，其中包括一个新的 136 种可能的单态蛋白和一组 201 种从文献中提取的变形蛋白结构。我们的模型能够将蛋白质分类为变形和单态，马修斯相关系数约为 0.36，真阳性/真阴性率约为 65%/80%，这表明仅使用序列信息就有可能预测蛋白质的变形行为。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcd7/7567988/2f4cc339bb3a/gr1.jpg

相似文献

Sequence-Based Prediction of Metamorphic Behavior in Proteins.

Biophys J. 2020 Oct 6;119(7):1380-1390. doi: 10.1016/j.bpj.2020.07.034. Epub 2020 Aug 14.

Design and discovery of metamorphic proteins.

Curr Opin Struct Biol. 2022 Jun;74:102380. doi: 10.1016/j.sbi.2022.102380. Epub 2022 May 10.

A high-throughput predictive method for sequence-similar fold switchers.

Biopolymers. 2021 Oct;112(10):e23416. doi: 10.1002/bip.23416. Epub 2021 Jan 19.

Metamorphic Proteins: Emergence of Dual Protein Folds from One Primary Sequence.

Biochemistry. 2017 Jun 20;56(24):2971-2984. doi: 10.1021/acs.biochem.7b00375. Epub 2017 Jun 12.

Identification and characterization of metamorphic proteins: Current and future perspectives.

Biopolymers. 2021 Oct;112(10):e23473. doi: 10.1002/bip.23473. Epub 2021 Sep 16.

Metamorphic proteins and how to find them.

Curr Opin Struct Biol. 2024 Jun;86:102807. doi: 10.1016/j.sbi.2024.102807. Epub 2024 Mar 26.

Metamorphic proteins mediate evolutionary transitions of structure.

Proc Natl Acad Sci U S A. 2010 Apr 20;107(16):7287-92. doi: 10.1073/pnas.0912616107. Epub 2010 Apr 5.

A sequence-based method for predicting extant fold switchers that undergo α-helix ↔ β-strand transitions.

Biopolymers. 2021 Oct;112(10):e23471. doi: 10.1002/bip.23471. Epub 2021 Sep 9.

Metamorphic proteins: the Janus proteins of structural biology.

Open Biol. 2021 Apr;11(4):210012. doi: 10.1098/rsob.210012. Epub 2021 Apr 21.

The role of negative selection in protein evolution revealed through the energetics of the native state ensemble.

Proteins. 2016 Apr;84(4):435-47. doi: 10.1002/prot.24989. Epub 2016 Feb 13.

引用本文的文献

Fold-switching proteins.

ArXiv. 2025 Jul 14:arXiv:2507.10839v1.

Unveiling the cold reality of metamorphic proteins.

Proc Natl Acad Sci U S A. 2025 Mar 25;122(12):e2422725122. doi: 10.1073/pnas.2422725122. Epub 2025 Mar 13.

Impact of local unfolding fluctuations on the evolution of regional sequence preferences in proteins.

Protein Sci. 2025 Mar;34(3):e70015. doi: 10.1002/pro.70015.

Metamorphic Proteins to Achieve Conformationally Selective Material Surface Binding.

Small. 2025 Feb;21(7):e2408141. doi: 10.1002/smll.202408141. Epub 2025 Jan 10.

Temperature-dependent fold-switching mechanism of the circadian clock protein KaiB.

Proc Natl Acad Sci U S A. 2024 Dec 17;121(51):e2412327121. doi: 10.1073/pnas.2412327121. Epub 2024 Dec 13.

Proteomic Evidence for Amyloidogenic Cross-Seeding in Fibrinaloid Microclots.

Int J Mol Sci. 2024 Oct 8;25(19):10809. doi: 10.3390/ijms251910809.

Fluid protein fold space and its implications.

Bioessays. 2023 Sep;45(9):e2300057. doi: 10.1002/bies.202300057. Epub 2023 Jul 11.

Distinguishing features of fold-switching proteins.

Protein Sci. 2023 Mar;32(3):e4596. doi: 10.1002/pro.4596.

Design and characterization of a protein fold switching network.

Nat Commun. 2023 Jan 26;14(1):431. doi: 10.1038/s41467-023-36065-3.

Intrinsically disordered regions that drive phase separation form a robustly distinct protein class.

J Biol Chem. 2023 Jan;299(1):102801. doi: 10.1016/j.jbc.2022.102801. Epub 2022 Dec 14.

本文引用的文献

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.

BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.

Inaccurate secondary structure predictions often indicate protein fold switching.

Protein Sci. 2019 Aug;28(8):1487-1493. doi: 10.1002/pro.3664. Epub 2019 Jun 17.

Structural metamorphism and polymorphism in proteins on the brink of thermodynamic stability.

Protein Sci. 2018 Sep;27(9):1557-1567. doi: 10.1002/pro.3458. Epub 2018 Sep 24.

Unfolding the Mysteries of Protein Metamorphosis.

ACS Chem Biol. 2018 Jun 15;13(6):1438-1446. doi: 10.1021/acschembio.8b00276. Epub 2018 Jun 7.

Extant fold-switching proteins are widespread.

Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):5968-5973. doi: 10.1073/pnas.1800168115. Epub 2018 May 21.

Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric.

PLoS One. 2017 Jun 2;12(6):e0177678. doi: 10.1371/journal.pone.0177678. eCollection 2017.

Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility.

Bioinformatics. 2017 Sep 15;33(18):2842-2849. doi: 10.1093/bioinformatics/btx218.

Structural basis of the day-night transition in a bacterial circadian clock.

Science. 2017 Mar 17;355(6330):1174-1180. doi: 10.1126/science.aag2516. Epub 2017 Mar 16.

Sixty-five years of the long march in protein secondary structure prediction: the final stretch?

Brief Bioinform. 2018 May 1;19(3):482-494. doi: 10.1093/bib/bbw129.

SPIDER2: A Package to Predict Secondary Structure, Accessible Surface Area, and Main-Chain Torsional Angles by Deep Neural Networks.

Methods Mol Biol. 2017;1484:55-63. doi: 10.1007/978-1-4939-6406-2_6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于序列的蛋白质构象转变行为预测。

Sequence-Based Prediction of Metamorphic Behavior in Proteins.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献