基于共进化的结构预测拓展同源检测的视野。

Extending the Horizon of Homology Detection with Coevolution-based Structure Prediction.

机构信息

Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK.

出版信息

J Mol Biol. 2021 Oct 1;433(20):167106. doi: 10.1016/j.jmb.2021.167106. Epub 2021 Jun 15.

DOI:10.1016/j.jmb.2021.167106

PMID:34139218

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8527833/

Abstract

Traditional sequence analysis algorithms fail to identify distant homologies when they lie beyond a detection horizon. In this review, we discuss how co-evolution-based contact and distance prediction methods are pushing back this homology detection horizon, thereby yielding new functional insights and experimentally testable hypotheses. Based on correlated substitutions, these methods divine three-dimensional constraints among amino acids in protein sequences that were previously devoid of all annotated domains and repeats. The new algorithms discern hidden structure in an otherwise featureless sequence landscape. Their revelatory impact promises to be as profound as the use, by archaeologists, of ground-penetrating radar to discern long-hidden, subterranean structures. As examples of this, we describe how triplicated structures reflecting longin domains in MON1A-like proteins, or UVR-like repeats in DISC1, emerge from their predicted contact and distance maps. These methods also help to resolve structures that do not conform to a "beads-on-a-string" model of protein domains. In one such example, we describe CFAP298 whose ubiquitin-like domain was previously challenging to perceive owing to a large sequence insertion within it. More generally, the new algorithms permit an easier appreciation of domain families and folds whose evolution involved structural insertion or rearrangement. As we exemplify with α1-antitrypsin, coevolution-based predicted contacts may also yield insights into protein dynamics and conformational change. This new combination of structure prediction (using innovative co-evolution based methods) and homology inference (using more traditional sequence analysis approaches) shows great promise for bringing into view a sea of evolutionary relationships that had hitherto lain far beyond the horizon of homology detection.

摘要

传统的序列分析算法在探测范围之外无法识别遥远的同源性。在这篇综述中，我们讨论了基于共进化的接触和距离预测方法如何将同源性检测的范围推回，从而产生新的功能见解和可实验验证的假设。基于相关替换，这些方法推断出蛋白质序列中氨基酸之间的三维约束，而这些氨基酸以前没有被注释为所有结构域和重复序列。这些新算法可以在原本没有任何特征的序列景观中发现隐藏的结构。它们的启示性影响有望与考古学家使用地面穿透雷达来识别长期隐藏的地下结构一样深远。作为这方面的例子，我们描述了 MON1A 样蛋白中的长因域反映出的三倍体结构，或 DISC1 中的 UVR 样重复结构如何从它们的预测接触和距离图中出现。这些方法还有助于解决不符合蛋白质结构域“串珠式”模型的结构。在一个这样的例子中，我们描述了 CFAP298，其泛素样结构域以前由于其内部的一个大序列插入而难以感知。更一般地说，新的算法允许更容易理解家族和折叠的结构域，其进化涉及结构插入或重排。正如我们用α1-抗胰蛋白酶所举例的那样，基于共进化的预测接触也可能提供关于蛋白质动力学和构象变化的见解。这种结构预测（使用创新的基于共进化的方法）和同源性推断（使用更传统的序列分析方法）的新组合为我们带来了更广阔的进化关系视角，这些关系以前远在同源性检测范围之外。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b806/8527833/7c5d281100f7/ga1.jpg

相似文献

Extending the Horizon of Homology Detection with Coevolution-based Structure Prediction.

J Mol Biol. 2021 Oct 1;433(20):167106. doi: 10.1016/j.jmb.2021.167106. Epub 2021 Jun 15.

High-throughput 3D structural homology detection via NMR resonance assignment.

Proc IEEE Comput Syst Bioinform Conf. 2004:278-89.

Filling-in void and sparse regions in protein sequence space by protein-like artificial sequences enables remarkable enhancement in remote homology detection capability.

J Mol Biol. 2014 Feb 20;426(4):962-79. doi: 10.1016/j.jmb.2013.11.026. Epub 2013 Dec 4.

Prediction of protein structural classes for low-homology sequences based on predicted secondary structure.

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-11-S1-S9.

Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning.

Proteins. 2018 Mar;86 Suppl 1(Suppl 1):84-96. doi: 10.1002/prot.25405. Epub 2017 Oct 31.

Detecting distant-homology protein structures by aligning deep neural-network based contact maps.

PLoS Comput Biol. 2019 Oct 17;15(10):e1007411. doi: 10.1371/journal.pcbi.1007411. eCollection 2019 Oct.

Rapid and enhanced remote homology detection by cascading hidden Markov model searches in sequence space.

Bioinformatics. 2016 Feb 1;32(3):338-44. doi: 10.1093/bioinformatics/btv538. Epub 2015 Oct 10.

Combining evolutionary information extracted from frequency profiles with sequence-based kernels for protein remote homology detection.

Bioinformatics. 2014 Feb 15;30(4):472-9. doi: 10.1093/bioinformatics/btt709. Epub 2013 Dec 5.

Exploring dynamics of protein structure determination and homology-based prediction to estimate the number of superfamilies and folds.

BMC Struct Biol. 2006 Mar 20;6:6. doi: 10.1186/1472-6807-6-6.

Detection of unrelated proteins in sequences multiple alignments by using predicted secondary structures.

Bioinformatics. 2003 Mar 1;19(4):506-12. doi: 10.1093/bioinformatics/btg016.

引用本文的文献

FBB18 is a ubiquitin-like protein essential for the cytoplasmic preassembly of various ciliary dyneins.

Proc Natl Acad Sci U S A. 2025 Mar 25;122(12):e2423948122. doi: 10.1073/pnas.2423948122. Epub 2025 Mar 19.

Genome-Wide Analysis of Proteases and Protease Inhibitors Using Advanced Informatics Provides Insights into Parasite Biology and Host-Parasite Interactions.

Int J Mol Sci. 2023 Aug 1;24(15):12320. doi: 10.3390/ijms241512320.

OAF: a new member of the BRICHOS family.

Bioinform Adv. 2022 Nov 24;2(1):vbac087. doi: 10.1093/bioadv/vbac087. eCollection 2022.

MES-3 is a highly divergent ortholog of the canonical PRC2 component SUZ12.

iScience. 2022 Jun 17;25(7):104633. doi: 10.1016/j.isci.2022.104633. eCollection 2022 Jul 15.

Collective Variable for Metadynamics Derived From AlphaFold Output.

Front Mol Biosci. 2022 Jun 13;9:878133. doi: 10.3389/fmolb.2022.878133. eCollection 2022.

本文引用的文献

The orthopedic characterization of cfap298 mutants validate zebrafish to faithfully model human AIS.

Sci Rep. 2021 Apr 1;11(1):7392. doi: 10.1038/s41598-021-86856-1.

Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.

PLoS Comput Biol. 2021 Mar 26;17(3):e1008865. doi: 10.1371/journal.pcbi.1008865. eCollection 2021 Mar.

Protein sequence design by conformational landscape optimization.

Proc Natl Acad Sci U S A. 2021 Mar 16;118(11). doi: 10.1073/pnas.2017228118.

Protein domain identification methods and online resources.

Comput Struct Biotechnol J. 2021 Feb 2;19:1145-1153. doi: 10.1016/j.csbj.2021.01.041. eCollection 2021.

Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences.

Nat Commun. 2021 Mar 2;12(1):1396. doi: 10.1038/s41467-021-21636-z.

Multiple Sequence Alignment Computation Using the T-Coffee Regressive Algorithm Implementation.

Methods Mol Biol. 2021;2231:89-97. doi: 10.1007/978-1-0716-1036-7_6.

GENCODE 2021.

Nucleic Acids Res. 2021 Jan 8;49(D1):D916-D923. doi: 10.1093/nar/gkaa1087.

'It will change everything': DeepMind's AI makes gigantic leap in solving protein structures.

Nature. 2020 Dec;588(7837):203-204. doi: 10.1038/d41586-020-03348-4.

RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures.

Nucleic Acids Res. 2021 Jan 8;49(D1):D452-D457. doi: 10.1093/nar/gkaa1097.

The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.

Nucleic Acids Res. 2021 Jan 8;49(D1):D605-D612. doi: 10.1093/nar/gkaa1074.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于共进化的结构预测拓展同源检测的视野。

Extending the Horizon of Homology Detection with Coevolution-based Structure Prediction.

机构信息

Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK.

出版信息

J Mol Biol. 2021 Oct 1;433(20):167106. doi: 10.1016/j.jmb.2021.167106. Epub 2021 Jun 15.

DOI:10.1016/j.jmb.2021.167106

PMID:34139218

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8527833/

Abstract

摘要

基于共进化的结构预测拓展同源检测的视野。

Extending the Horizon of Homology Detection with Coevolution-based Structure Prediction.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于共进化的结构预测拓展同源检测的视野。

Extending the Horizon of Homology Detection with Coevolution-based Structure Prediction.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献