• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高效蛋白质比对算法在蛋白质搜索中的应用。

Efficient protein alignment algorithm for protein search.

机构信息

Department of Computer Science, University of Texas-Pan American, Edinburg, TX 78539, USA.

出版信息

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S34. doi: 10.1186/1471-2105-11-S1-S34.

DOI:10.1186/1471-2105-11-S1-S34
PMID:20122207
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3009506/
Abstract

BACKGROUND

Proteins show a great variety of 3D conformations, which can be used to infer their evolutionary relationship and to classify them into more general groups; therefore protein structure alignment algorithms are very helpful for protein biologists. However, an accurate alignment algorithm itself may be insufficient for effective discovering of structural relationships among tens of thousands of proteins. Due to the exponentially increasing amount of protein structural data, a fast and accurate structure alignment tool is necessary to access protein classification and protein similarity search; however, the complexity of current alignment algorithms are usually too high to make a fully alignment-based classification and search practical.

RESULTS

We have developed an efficient protein pairwise alignment algorithm and applied it to our protein search tool, which aligns a query protein structure in the pairwise manner with all protein structures in the Protein Data Bank (PDB) to output similar protein structures. The algorithm can align hundreds of pairs of protein structures in one second. Given a protein structure, the tool efficiently discovers similar structures from tens of thousands of structures stored in the PDB always in 2 minutes in a single machine and 20 seconds in our cluster of 6 machines. The algorithm has been fully implemented and is accessible online at our webserver, which is supported by a cluster of computers.

CONCLUSION

Our algorithm can work out hundreds of pairs of protein alignments in one second. Therefore, it is very suitable for protein search. Our experimental results show that it is more accurate than other well known protein search systems in finding proteins which are structurally similar at SCOP family and superfamily levels, and its speed is also competitive with those systems. In terms of the pairwise alignment performance, it is as good as some well known alignment algorithms.

摘要

背景

蛋白质具有多种 3D 构象,可以用于推断它们的进化关系,并将它们分类为更一般的组别;因此,蛋白质结构比对算法对蛋白质生物学家非常有帮助。然而,一个准确的比对算法本身可能不足以有效地发现数以万计的蛋白质之间的结构关系。由于蛋白质结构数据的数量呈指数级增长,因此需要一个快速而准确的结构比对工具来访问蛋白质分类和蛋白质相似性搜索;然而,当前比对算法的复杂性通常过高,无法实现完全基于比对的分类和搜索。

结果

我们开发了一种高效的蛋白质两两比对算法,并将其应用于我们的蛋白质搜索工具中,该工具以两两的方式将查询蛋白质结构与蛋白质数据库(PDB)中的所有蛋白质结构进行比对,以输出相似的蛋白质结构。该算法可以在一秒钟内比对数百对蛋白质结构。给定一个蛋白质结构,该工具可以在一台机器上 2 分钟内,在我们的 6 台机器集群中 20 秒内,从 PDB 中存储的数万结构中高效地发现相似结构。该算法已完全实现,并可在我们的计算机集群支持的在线网络服务器上使用。

结论

我们的算法可以在一秒钟内处理数百对蛋白质的比对。因此,它非常适合蛋白质搜索。我们的实验结果表明,它在寻找在 SCOP 家族和超家族水平上结构相似的蛋白质方面比其他著名的蛋白质搜索系统更准确,并且它的速度也具有竞争力。在两两比对性能方面,它与一些著名的比对算法一样好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d957/3009506/d9cc15fc3762/1471-2105-11-S1-S34-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d957/3009506/466eba571046/1471-2105-11-S1-S34-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d957/3009506/81a6a9a16bdb/1471-2105-11-S1-S34-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d957/3009506/d9cc15fc3762/1471-2105-11-S1-S34-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d957/3009506/466eba571046/1471-2105-11-S1-S34-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d957/3009506/81a6a9a16bdb/1471-2105-11-S1-S34-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d957/3009506/d9cc15fc3762/1471-2105-11-S1-S34-3.jpg

相似文献

1
Efficient protein alignment algorithm for protein search.高效蛋白质比对算法在蛋白质搜索中的应用。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S34. doi: 10.1186/1471-2105-11-S1-S34.
2
Search similar protein structures with classification, sequence and 3d alignments.通过分类、序列和三维比对搜索相似的蛋白质结构。
J Bioinform Comput Biol. 2009 Oct;7(5):755-71. doi: 10.1142/s021972000900431x.
3
RCSB protein Data Bank: exploring protein 3D similarities via comprehensive structural alignments.RCSB 蛋白质数据库:通过全面的结构比对探索蛋白质 3D 相似性。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae370.
4
Large-scale comparison of protein sequence alignment algorithms with structure alignments.蛋白质序列比对算法与结构比对的大规模比较。
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.
5
Automatic classification of protein structures using low-dimensional structure space mappings.利用低维结构空间映射对蛋白质结构进行自动分类。
BMC Bioinformatics. 2014;15 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-15-S2-S1. Epub 2014 Jan 24.
6
mTM-align: an algorithm for fast and accurate multiple protein structure alignment.mTM-align:一种快速准确的多蛋白质结构比对算法。
Bioinformatics. 2018 May 15;34(10):1719-1725. doi: 10.1093/bioinformatics/btx828.
7
CAALIGN: a program for pairwise and multiple protein-structure alignment.CAALIGN:一个用于蛋白质结构两两比对和多序列比对的程序。
Acta Crystallogr D Biol Crystallogr. 2007 Apr;63(Pt 4):514-25. doi: 10.1107/S0907444907000844. Epub 2007 Mar 16.
8
High quality protein sequence alignment by combining structural profile prediction and profile alignment using SABER-TOOTH.使用 SABER-TOOTH 结合结构轮廓预测和轮廓比对进行高质量蛋白质序列比对。
BMC Bioinformatics. 2010 May 14;11:251. doi: 10.1186/1471-2105-11-251.
9
Cross-over between discrete and continuous protein structure space: insights into automatic classification and networks of protein structures.离散与连续蛋白质结构空间之间的交叉:对蛋白质结构自动分类及网络的见解。
PLoS Comput Biol. 2009 Mar;5(3):e1000331. doi: 10.1371/journal.pcbi.1000331. Epub 2009 Mar 27.
10
Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.Fr-TM-align:一种基于片段比对和TM分数的新型蛋白质结构比对方法。
BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531.

引用本文的文献

1
Dynamic programming used to align protein structures with a spectrum is robust.动态规划用于将蛋白质结构与谱进行对齐是稳健的。
Biology (Basel). 2013 Nov 20;2(4):1296-310. doi: 10.3390/biology2041296.

本文引用的文献

1
Iterative non-sequential protein structural alignment.迭代非顺序蛋白质结构比对
Comput Syst Bioinformatics Conf. 2008;7:183-94.
2
Searching protein structure databases with DaliLite v.3.使用DaliLite v.3搜索蛋白质结构数据库。
Bioinformatics. 2008 Dec 1;24(23):2780-1. doi: 10.1093/bioinformatics/btn507. Epub 2008 Sep 25.
3
Feedback algorithm and web-server for protein structure alignment.用于蛋白质结构比对的反馈算法与网络服务器。
J Comput Biol. 2008 Jun;15(5):505-24. doi: 10.1089/cmb.2008.0075.
4
Protein structure-structure alignment with discrete Fréchet distance.基于离散弗雷歇距离的蛋白质结构-结构比对
J Bioinform Comput Biol. 2008 Feb;6(1):51-64. doi: 10.1142/s0219720008003278.
5
Data growth and its impact on the SCOP database: new developments.数据增长及其对SCOP数据库的影响:新进展
Nucleic Acids Res. 2008 Jan;36(Database issue):D419-25. doi: 10.1093/nar/gkm993. Epub 2007 Nov 13.
6
Protein structural similarity search by Ramachandran codes.通过拉马钱德兰编码进行蛋白质结构相似性搜索。
BMC Bioinformatics. 2007 Aug 23;8:307. doi: 10.1186/1471-2105-8-307.
7
Growth of novel protein structural data.新型蛋白质结构数据的增长。
Proc Natl Acad Sci U S A. 2007 Feb 27;104(9):3183-8. doi: 10.1073/pnas.0611678104. Epub 2007 Feb 20.
8
Protein structure database search and evolutionary classification.蛋白质结构数据库搜索与进化分类。
Nucleic Acids Res. 2006 Aug 2;34(13):3646-59. doi: 10.1093/nar/gkl395. Print 2006.
9
TM-align: a protein structure alignment algorithm based on the TM-score.TM-align:一种基于TM分数的蛋白质结构比对算法。
Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. doi: 10.1093/nar/gki524. Print 2005.
10
The URMS-RMS hybrid algorithm for fast and sensitive local protein structure alignment.用于快速灵敏的局部蛋白质结构比对的URMS-RMS混合算法。
J Comput Biol. 2005;12(1):12-32. doi: 10.1089/cmb.2005.12.12.