• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用结构探索远程同源物序列比对空间。

Using structure to explore the sequence alignment space of remote homologs.

机构信息

Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, United States of America.

出版信息

PLoS Comput Biol. 2011 Oct;7(10):e1002175. doi: 10.1371/journal.pcbi.1002175. Epub 2011 Oct 6.

DOI:10.1371/journal.pcbi.1002175
PMID:21998567
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3188491/
Abstract

Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting the applicability of modeling methods. A central problem is that the alignment that is "optimal" in terms of the DP score does not necessarily correspond to the alignment that produces the most accurate structural model. That is, the correct alignment based on structural superposition will generally have a lower score than the optimal alignment obtained from sequence. Variations of the DP algorithm have been developed that generate alternative alignments that are "suboptimal" in terms of the DP score, but these still encounter difficulties in detecting the correct structural alignment. We present here a new alternative sequence alignment method that relies heavily on the structure of the template. By initially aligning the query sequence to individual fragments in secondary structure elements and combining high-scoring fragments that pass basic tests for "modelability", we can generate accurate alignments within a small ensemble. Our results suggest that the set of sequences that can currently be modeled by homology can be greatly extended.

摘要

同源蛋白结构建模需要在查询蛋白与其结构模板之间进行精确的序列比对。然而,基于动态规划 (DP) 的序列比对方法通常无法为远程序列同源物生成准确的比对,从而限制了建模方法的适用性。一个核心问题是,根据 DP 得分“最优”的比对并不一定对应于生成最准确结构模型的比对。也就是说,基于结构叠加的正确比对通常比从序列获得的最优比对得分低。已经开发了 DP 算法的变体,这些变体生成了在 DP 得分方面“次优”的替代比对,但在检测正确的结构比对时仍然存在困难。我们在这里提出了一种新的替代序列比对方法,该方法严重依赖于模板的结构。通过最初将查询序列与二级结构元素中的各个片段对齐,并组合通过“可建模性”基本测试的高得分片段,我们可以在小集合内生成准确的比对。我们的结果表明,目前可以通过同源建模的序列集可以大大扩展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/1c26ca383b4a/pcbi.1002175.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/059473fd7903/pcbi.1002175.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/fbcdfcb5e323/pcbi.1002175.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/9050d56c407c/pcbi.1002175.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/b8eebdcffc05/pcbi.1002175.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/032b84620a75/pcbi.1002175.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/1c26ca383b4a/pcbi.1002175.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/059473fd7903/pcbi.1002175.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/fbcdfcb5e323/pcbi.1002175.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/9050d56c407c/pcbi.1002175.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/b8eebdcffc05/pcbi.1002175.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/032b84620a75/pcbi.1002175.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3188491/1c26ca383b4a/pcbi.1002175.g006.jpg

相似文献

1
Using structure to explore the sequence alignment space of remote homologs.利用结构探索远程同源物序列比对空间。
PLoS Comput Biol. 2011 Oct;7(10):e1002175. doi: 10.1371/journal.pcbi.1002175. Epub 2011 Oct 6.
2
Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases.用于蛋白质同源物的迭代序列/二级结构搜索:与氨基酸序列比对的比较及在基因组数据库中折叠识别的应用
Bioinformatics. 2000 Nov;16(11):988-1002. doi: 10.1093/bioinformatics/16.11.988.
3
Optimal pairwise alignment of fixed protein structures in subquadratic time.在亚二次时间内对固定蛋白质结构进行最优成对比对。
J Bioinform Comput Biol. 2011 Jun;9(3):367-82. doi: 10.1142/s0219720011005562.
4
ProALIGN: Directly Learning Alignments for Protein Structure Prediction via Exploiting Context-Specific Alignment Motifs.ProALIGN:通过利用上下文特定的对齐基序直接学习蛋白质结构预测的对齐。
J Comput Biol. 2022 Feb;29(2):92-105. doi: 10.1089/cmb.2021.0430. Epub 2022 Jan 21.
5
On the accuracy of homology modeling and sequence alignment methods applied to membrane proteins.关于应用于膜蛋白的同源建模和序列比对方法的准确性
Biophys J. 2006 Jul 15;91(2):508-17. doi: 10.1529/biophysj.106.082313. Epub 2006 Apr 28.
6
CAALIGN: a program for pairwise and multiple protein-structure alignment.CAALIGN:一个用于蛋白质结构两两比对和多序列比对的程序。
Acta Crystallogr D Biol Crystallogr. 2007 Apr;63(Pt 4):514-25. doi: 10.1107/S0907444907000844. Epub 2007 Mar 16.
7
Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.Fr-TM-align:一种基于片段比对和TM分数的新型蛋白质结构比对方法。
BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531.
8
Detecting distant-homology protein structures by aligning deep neural-network based contact maps.通过对齐基于深度神经网络的接触图来检测远程同源蛋白结构。
PLoS Comput Biol. 2019 Oct 17;15(10):e1007411. doi: 10.1371/journal.pcbi.1007411. eCollection 2019 Oct.
9
Overcoming sequence misalignments with weighted structural superposition.利用加权结构叠加克服序列不对齐。
Proteins. 2012 Nov;80(11):2523-35. doi: 10.1002/prot.24134. Epub 2012 Jul 28.
10
Homology-based modeling of 3D structures of protein-protein complexes using alignments of modified sequence profiles.利用修饰序列谱比对进行蛋白质-蛋白质复合物三维结构的基于同源性的建模。
Int J Biol Macromol. 2008 Aug 15;43(2):198-208. doi: 10.1016/j.ijbiomac.2008.05.004. Epub 2008 May 21.

引用本文的文献

1
Discovering the Ultimate Limits of Protein Secondary Structure Prediction.揭示蛋白质二级结构预测的极限。
Biomolecules. 2021 Nov 3;11(11):1627. doi: 10.3390/biom11111627.
2
p21-activated Kinases (PAKs) Mediate the Phosphorylation of PREX2 Protein to Initiate Feedback Inhibition of Rac1 GTPase.p21激活激酶(PAKs)介导PREX2蛋白的磷酸化,从而启动对Rac1 GTP酶的反馈抑制。
J Biol Chem. 2015 Nov 27;290(48):28915-31. doi: 10.1074/jbc.M115.668244. Epub 2015 Oct 5.
3
How a spatial arrangement of secondary structure elements is dispersed in the universe of protein folds.

本文引用的文献

1
High-throughput computational structure-based characterization of protein families: START domains and implications for structural genomics.基于结构的蛋白质家族高通量计算表征:START结构域及其对结构基因组学的意义。
J Struct Funct Genomics. 2010 Mar;11(1):51-9. doi: 10.1007/s10969-010-9086-7. Epub 2010 Apr 11.
2
Structural relationships among proteins with different global topologies and their implications for function annotation strategies.具有不同整体拓扑结构的蛋白质之间的结构关系及其对功能注释策略的影响。
Proc Natl Acad Sci U S A. 2009 Oct 13;106(41):17377-82. doi: 10.1073/pnas.0907971106. Epub 2009 Sep 24.
3
二级结构元件的空间排列是如何在蛋白质折叠的宇宙中分布的。
PLoS One. 2014 Sep 22;9(9):e107959. doi: 10.1371/journal.pone.0107959. eCollection 2014.
4
Protein sequence alignment analysis by local covariation: coevolution statistics detect benchmark alignment errors.通过局部共变进行蛋白质序列比对分析:协同进化统计检测基准比对错误。
PLoS One. 2012;7(6):e37645. doi: 10.1371/journal.pone.0037645. Epub 2012 Jun 8.
Estimating quality of template-based protein models by alignment stability.
通过比对稳定性评估基于模板的蛋白质模型的质量。
Proteins. 2008 May 15;71(3):1255-74. doi: 10.1002/prot.21819.
4
Critical assessment of methods of protein structure prediction-Round VII.蛋白质结构预测方法的批判性评估——第七轮。
Proteins. 2007;69 Suppl 8(S8):3-9. doi: 10.1002/prot.21767.
5
Fold recognition by concurrent use of solvent accessibility and residue depth.通过同时使用溶剂可及性和残基深度进行折叠识别。
Proteins. 2007 Aug 15;68(3):636-45. doi: 10.1002/prot.21459.
6
Strategies for high-throughput comparative modeling: applications to leverage analysis in structural genomics and protein family organization.高通量比较建模策略:在结构基因组学和蛋白质家族组织中用于杠杆分析的应用
Proteins. 2007 Mar 1;66(4):766-77. doi: 10.1002/prot.21191.
7
Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection.使用具有一致性和基于能量的模型选择的参数比对集成生成进行同源建模。
Nucleic Acids Res. 2006;34(17):e112. doi: 10.1093/nar/gkl480. Epub 2006 Sep 13.
8
Sequence alignments in the neighborhood of the optimum with general application to dynamic programming.最优邻域序列比对及其在动态规划中的广泛应用。
Proc Natl Acad Sci U S A. 1983 May;80(10):3123-4. doi: 10.1073/pnas.80.10.3123.
9
On the origin and highly likely completeness of single-domain protein structures.关于单结构域蛋白质结构的起源及极有可能的完整性
Proc Natl Acad Sci U S A. 2006 Feb 21;103(8):2605-10. doi: 10.1073/pnas.0509379103. Epub 2006 Feb 14.
10
Variable gap penalty for protein sequence-structure alignment.蛋白质序列-结构比对中的可变空位罚分。
Protein Eng Des Sel. 2006 Mar;19(3):129-33. doi: 10.1093/protein/gzj005. Epub 2006 Jan 19.