• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质序列比较的局限性?

The limits of protein sequence comparison?

作者信息

Pearson William R, Sierk Michael L

机构信息

Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908, USA.

出版信息

Curr Opin Struct Biol. 2005 Jun;15(3):254-60. doi: 10.1016/j.sbi.2005.05.005.

DOI:10.1016/j.sbi.2005.05.005
PMID:15919194
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2845305/
Abstract

Modern sequence alignment algorithms are used routinely to identify homologous proteins, proteins that share a common ancestor. Homologous proteins always share similar structures and often have similar functions. Over the past 20 years, sequence comparison has become both more sensitive, largely because of profile-based methods, and more reliable, because of more accurate statistical estimates. As sequence and structure databases become larger, and comparison methods become more powerful, reliable statistical estimates will become even more important for distinguishing similarities that are due to homology from those that are due to analogy (convergence). The newest sequence alignment methods are more sensitive than older methods, but more accurate statistical estimates are needed for their full power to be realized.

摘要

现代序列比对算法经常被用于识别同源蛋白质,即拥有共同祖先的蛋白质。同源蛋白质总是具有相似的结构,并且常常具有相似的功能。在过去20年里,序列比较在很大程度上由于基于轮廓的方法而变得更加灵敏,又由于更精确的统计估计而变得更加可靠。随着序列和结构数据库不断增大,以及比较方法变得更加强大,可靠的统计估计对于区分同源相似性和类比(趋同)相似性将变得愈发重要。最新的序列比对方法比旧方法更加灵敏,但要充分发挥其强大功能还需要更精确的统计估计。

相似文献

1
The limits of protein sequence comparison?蛋白质序列比较的局限性?
Curr Opin Struct Biol. 2005 Jun;15(3):254-60. doi: 10.1016/j.sbi.2005.05.005.
2
Functional proteomics with biolinguistic methods. n-grams deliver sensitive portrayals of gene similarity.基于生物语言学方法的功能蛋白质组学。词元提供了对基因相似性的敏感描述。
IEEE Eng Med Biol Mag. 2005 May-Jun;24(3):73-80. doi: 10.1109/memb.2005.1436463.
3
Predicting secondary structures of proteins. Recognizing properties of amino acids with the logical analysis of data algorithm.预测蛋白质的二级结构。通过数据算法的逻辑分析识别氨基酸的特性。
IEEE Eng Med Biol Mag. 2005 May-Jun;24(3):88-94. doi: 10.1109/memb.2005.1436465.
4
Detection of unrelated proteins in sequences multiple alignments by using predicted secondary structures.利用预测的二级结构在多序列比对中检测不相关蛋白质。
Bioinformatics. 2003 Mar 1;19(4):506-12. doi: 10.1093/bioinformatics/btg016.
5
Pairwise protein structure alignment based on an orientation-independent backbone representation.基于与方向无关的主链表示的成对蛋白质结构比对。
J Bioinform Comput Biol. 2004 Dec;2(4):699-717. doi: 10.1142/s021972000400082x.
6
DOMAC: an accurate, hybrid protein domain prediction server.DOMAC:一个准确的混合蛋白质结构域预测服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W354-6. doi: 10.1093/nar/gkm390. Epub 2007 Jun 6.
7
Protein function from sequence and structure data.基于序列和结构数据的蛋白质功能
Appl Bioinformatics. 2003;2(1):3-12.
8
Non-sequential structure-based alignments reveal topology-independent core packing arrangements in proteins.基于非序列结构的比对揭示了蛋白质中与拓扑结构无关的核心堆积排列。
Bioinformatics. 2005 Apr 1;21(7):1010-9. doi: 10.1093/bioinformatics/bti128. Epub 2004 Nov 5.
9
DBAli tools: mining the protein structure space.DBAli工具:挖掘蛋白质结构空间
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W393-7. doi: 10.1093/nar/gkm236. Epub 2007 May 3.
10
PSI: indexing protein structures for fast similarity search.PSI:为快速相似性搜索对蛋白质结构进行索引。
Bioinformatics. 2003;19 Suppl 1:i81-3. doi: 10.1093/bioinformatics/btg1009.

引用本文的文献

1
Exploring P450 superfamily diversity with P450Atlas - Online tool for automated subfamily assignment.使用P450Atlas探索细胞色素P450超家族多样性——自动亚家族分类在线工具。
Protein Sci. 2025 Mar;34(3):e70057. doi: 10.1002/pro.70057.
2
Insights into virulence: structure classification of the RIMD mobilome.洞悉毒力:RIMD 可移动组学的结构分类。
mSystems. 2023 Dec 21;8(6):e0079623. doi: 10.1128/msystems.00796-23. Epub 2023 Nov 28.
3
PFP-GO: Integrating protein sequence, domain and protein-protein interaction information for protein function prediction using ranked GO terms.

本文引用的文献

1
Maximum-likelihood estimation of the statistical distribution of Smith-Waterman local sequence similarity scores.史密斯-沃特曼局部序列相似性得分统计分布的最大似然估计。
Bull Math Biol. 1992 Jan;54(1):59-75. doi: 10.1007/BF02458620.
2
Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures.蛋白质结构比对方法的综合评估:基于几何度量的评分
J Mol Biol. 2005 Mar 4;346(4):1173-88. doi: 10.1016/j.jmb.2004.12.032. Epub 2005 Jan 16.
3
GenBank.基因银行
PFP-GO:利用排序后的基因本体(GO)术语整合蛋白质序列、结构域和蛋白质-蛋白质相互作用信息以进行蛋白质功能预测。
Front Genet. 2022 Sep 29;13:969915. doi: 10.3389/fgene.2022.969915. eCollection 2022.
4
Co-component signal transduction systems: Fast-evolving virulence regulation cassettes discovered in enteric bacteria.共组件信号转导系统:肠道细菌中发现的快速进化毒力调节盒。
Proc Natl Acad Sci U S A. 2022 Jun 14;119(24):e2203176119. doi: 10.1073/pnas.2203176119. Epub 2022 Jun 1.
5
Extending the Horizon of Homology Detection with Coevolution-based Structure Prediction.基于共进化的结构预测拓展同源检测的视野。
J Mol Biol. 2021 Oct 1;433(20):167106. doi: 10.1016/j.jmb.2021.167106. Epub 2021 Jun 15.
6
Patterns of Sequence and Expression Diversification Associate Members of the Gene Family With Response to Fungal Pathogens.序列和表达多样化模式将基因家族成员与对真菌病原体的反应联系起来。
Front Genet. 2020 May 29;11:491. doi: 10.3389/fgene.2020.00491. eCollection 2020.
7
Graph Theory-Based Sequence Descriptors as Remote Homology Predictors.基于图论的序列描述符可作为远程同源性预测器。
Biomolecules. 2019 Dec 23;10(1):26. doi: 10.3390/biom10010026.
8
What is an archaeon and are the Archaea really unique?古菌是什么,古菌真的独特吗?
PeerJ. 2018 Oct 18;6:e5770. doi: 10.7717/peerj.5770. eCollection 2018.
9
The genomic underpinnings of eukaryotic virus taxonomy: creating a sequence-based framework for family-level virus classification.真核病毒分类学的基因组基础:创建基于序列的病毒科分类框架。
Microbiome. 2018 Feb 20;6(1):38. doi: 10.1186/s40168-018-0422-7.
10
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.核酸和氨基酸序列支持基于结构的病毒分类。
J Virol. 2017 Mar 29;91(8). doi: 10.1128/JVI.02275-16. Print 2017 Apr 15.
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D34-8. doi: 10.1093/nar/gki063.
4
Profile-profile methods provide improved fold-recognition: a study of different profile-profile alignment methods.轮廓-轮廓方法可提供改进的折叠识别:不同轮廓-轮廓比对方法的研究
Proteins. 2004 Oct 1;57(1):188-97. doi: 10.1002/prot.20184.
5
Scoring profile-to-profile sequence alignments.对图谱与图谱之间的序列进行比对评分。
Protein Sci. 2004 Jun;13(6):1612-26. doi: 10.1110/ps.03601504.
6
Protein sequence databases.蛋白质序列数据库。
Curr Opin Chem Biol. 2004 Feb;8(1):76-80. doi: 10.1016/j.cbpa.2003.12.004.
7
Sensitivity and selectivity in protein structure comparison.蛋白质结构比较中的敏感性和选择性。
Protein Sci. 2004 Mar;13(3):773-85. doi: 10.1110/ps.03328504.
8
COACH: profile-profile alignment of protein families using hidden Markov models.COACH:使用隐马尔可夫模型对蛋白质家族进行轮廓-轮廓比对。
Bioinformatics. 2004 May 22;20(8):1309-18. doi: 10.1093/bioinformatics/bth091. Epub 2004 Feb 12.
9
A comparison of scoring functions for protein sequence profile alignment.蛋白质序列谱比对评分函数的比较
Bioinformatics. 2004 May 22;20(8):1301-8. doi: 10.1093/bioinformatics/bth090. Epub 2004 Feb 12.
10
Using evolutionary information for the query and target improves fold recognition.将进化信息用于查询序列和目标序列可提高折叠识别率。
Proteins. 2004 Feb 1;54(2):342-50. doi: 10.1002/prot.10565.