• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质序列的同时比对与折叠

Simultaneous alignment and folding of protein sequences.

作者信息

Waldispühl Jérôme, O'Donnell Charles W, Will Sebastian, Devadas Srinivas, Backofen Rolf, Berger Bonnie

机构信息

1 School of Computer Science, McGill University , Montreal, Canada .

出版信息

J Comput Biol. 2014 Jul;21(7):477-91. doi: 10.1089/cmb.2013.0163. Epub 2014 Apr 25.

DOI:10.1089/cmb.2013.0163
PMID:24766258
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4082353/
Abstract

Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We present partiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm's complexity is polynomial in time and space. Algorithmically, partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments, partiFold-Align significantly outperforms state-of-the-art pairwise and multiple sequence alignment tools in the most difficult low-sequence homology case. It also improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families (partiFold-Align is available at http://partifold.csail.mit.edu/ ).

摘要

对于低同源性蛋白质而言,精确的比较分析工具在计算生物学领域仍是一项艰巨挑战,尤其是在序列比对和共有折叠问题方面。我们提出了partiFold - Align,这是首个用于未比对蛋白质序列同时进行比对和共有折叠的算法;该算法在时间和空间上的复杂度均为多项式。从算法角度来看,partiFold - Align利用超二级结构配对和比对候选集中的稀疏性,实现了同时进行成对序列比对和折叠的有效立方运行时间。我们在跨膜β桶蛋白上展示了这些技术的有效性,跨膜β桶蛋白是一类重要但难度较大的蛋白质,已知三维结构很少。与基于结构推导的序列比对进行测试时,在最困难的低序列同源性情况下,partiFold - Align显著优于当前最先进的成对和多序列比对工具。在当前方法失效的二级结构预测方面,它也有所改进。重要的是,partiFold - Align无需预先训练。这些通用技术广泛适用于更多蛋白质家族(partiFold - Align可在http://partifold.csail.mit.edu/获取)。

相似文献

1
Simultaneous alignment and folding of protein sequences.蛋白质序列的同时比对与折叠
J Comput Biol. 2014 Jul;21(7):477-91. doi: 10.1089/cmb.2013.0163. Epub 2014 Apr 25.
2
Modeling ensembles of transmembrane beta-barrel proteins.跨膜β桶状蛋白的模型集成
Proteins. 2008 May 15;71(3):1097-112. doi: 10.1002/prot.21788.
3
A max-margin model for efficient simultaneous alignment and folding of RNA sequences.一种用于RNA序列高效同时比对和折叠的最大间隔模型。
Bioinformatics. 2008 Jul 1;24(13):i68-76. doi: 10.1093/bioinformatics/btn177.
4
Structure alignment of membrane proteins: Accuracy of available tools and a consensus strategy.膜蛋白的结构比对:现有工具的准确性及一种共识策略
Proteins. 2015 Sep;83(9):1720-32. doi: 10.1002/prot.24857. Epub 2015 Aug 1.
5
Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases.用于蛋白质同源物的迭代序列/二级结构搜索:与氨基酸序列比对的比较及在基因组数据库中折叠识别的应用
Bioinformatics. 2000 Nov;16(11):988-1002. doi: 10.1093/bioinformatics/16.11.988.
6
mTM-align: an algorithm for fast and accurate multiple protein structure alignment.mTM-align:一种快速准确的多蛋白质结构比对算法。
Bioinformatics. 2018 May 15;34(10):1719-1725. doi: 10.1093/bioinformatics/btx828.
7
SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures.SPEM:利用序列概况和预测的二级结构改进多序列比对
Bioinformatics. 2005 Sep 15;21(18):3615-21. doi: 10.1093/bioinformatics/bti582. Epub 2005 Jul 14.
8
PRALINETM: a strategy for improved multiple alignment of transmembrane proteins.PRALINETM:一种改进跨膜蛋白多重比对的策略。
Bioinformatics. 2008 Feb 15;24(4):492-7. doi: 10.1093/bioinformatics/btm636. Epub 2008 Jan 2.
9
Alignment of protein sequences by their profiles.通过蛋白质序列的图谱进行比对。
Protein Sci. 2004 Apr;13(4):1071-87. doi: 10.1110/ps.03379804.
10
Consensus folding of unaligned RNA sequences revisited.重新审视未比对RNA序列的一致性折叠
J Comput Biol. 2006 Mar;13(2):283-95. doi: 10.1089/cmb.2006.13.283.

本文引用的文献

1
Pfam: the protein families database.Pfam:蛋白质家族数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30. doi: 10.1093/nar/gkt1223. Epub 2013 Nov 27.
2
Efficient traversal of beta-sheet protein folding pathways using ensemble models.使用集成模型高效遍历β-折叠蛋白质折叠途径
J Comput Biol. 2011 Nov;18(11):1635-47. doi: 10.1089/cmb.2011.0176. Epub 2011 Sep 29.
3
A method for probing the mutational landscape of amyloid structure.一种探测淀粉样结构突变特征的方法。
Bioinformatics. 2011 Jul 1;27(13):i34-42. doi: 10.1093/bioinformatics/btr238.
4
A max-margin model for efficient simultaneous alignment and folding of RNA sequences.一种用于RNA序列高效同时比对和折叠的最大间隔模型。
Bioinformatics. 2008 Jul 1;24(13):i68-76. doi: 10.1093/bioinformatics/btn177.
5
Matt: local flexibility aids protein multiple structure alignment.马特:局部灵活性有助于蛋白质多结构比对。
PLoS Comput Biol. 2008 Jan;4(1):e10. doi: 10.1371/journal.pcbi.0040010.
6
Modeling ensembles of transmembrane beta-barrel proteins.跨膜β桶状蛋白的模型集成
Proteins. 2008 May 15;71(3):1097-112. doi: 10.1002/prot.21788.
7
Fast pairwise structural RNA alignments by pruning of the dynamical programming matrix.通过修剪动态规划矩阵实现快速成对结构RNA比对。
PLoS Comput Biol. 2007 Oct;3(10):1896-908. doi: 10.1371/journal.pcbi.0030193. Epub 2007 Aug 20.
8
Variations on RNA folding and alignment: lessons from Benasque.RNA折叠与比对的变体:来自贝纳斯克的经验教训。
J Math Biol. 2008 Jan;56(1-2):129-44. doi: 10.1007/s00285-007-0107-5. Epub 2007 Jul 5.
9
Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering.通过基于基因组规模结构的聚类推断非编码RNA家族和类别。
PLoS Comput Biol. 2007 Apr 13;3(4):e65. doi: 10.1371/journal.pcbi.0030065. Epub 2007 Feb 22.
10
Quantification of the variation in percentage identity for protein sequence alignments.蛋白质序列比对中百分比一致性变化的量化。
BMC Bioinformatics. 2006 Sep 19;7:415. doi: 10.1186/1471-2105-7-415.