• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

犰狳:基于氨基酸组成的结构域边界预测

Armadillo: domain boundary prediction by amino acid composition.

作者信息

Dumontier Michel, Yao Rong, Feldman Howard J, Hogue Christopher W V

机构信息

Department of Biochemistry, University of Toronto, Toronto, Ont., Canada M5S 1A8.

出版信息

J Mol Biol. 2005 Jul 29;350(5):1061-73. doi: 10.1016/j.jmb.2005.05.037.

DOI:10.1016/j.jmb.2005.05.037
PMID:15978619
Abstract

The identification and annotation of protein domains provides a critical step in the accurate determination of molecular function. Both computational and experimental methods of protein structure determination may be deterred by large multi-domain proteins or flexible linker regions. Knowledge of domains and their boundaries may reduce the experimental cost of protein structure determination by allowing researchers to work on a set of smaller and possibly more successful alternatives. Current domain prediction methods often rely on sequence similarity to conserved domains and as such are poorly suited to detect domain structure in poorly conserved or orphan proteins. We present here a simple computational method to identify protein domain linkers and their boundaries from sequence information alone. Our domain predictor, Armadillo (http://armadillo.blueprint.org), uses any amino acid index to convert a protein sequence to a smoothed numeric profile from which domains and domain boundaries may be predicted. We derived an amino acid index called the domain linker propensity index (DLI) from the amino acid composition of domain linkers using a non-redundant structure dataset. The index indicates that Pro and Gly show a propensity for linker residues while small hydrophobic residues do not. Armadillo predicts domain linker boundaries from Z-score distributions and obtains 35% sensitivity with DLI in a two-domain, single-linker dataset (within +/-20 residues from linker). The combination of DLI and an entropy-based amino acid index increases the overall Armadillo sensitivity to 56% for two domain proteins. Moreover, Armadillo achieves 37% sensitivity for multi-domain proteins, surpassing most other prediction methods. Armadillo provides a simple, but effective method by which prediction of domain boundaries can be obtained with reasonable sensitivity. Armadillo should prove to be a valuable tool for rapidly delineating protein domains in poorly conserved proteins or those with no sequence neighbors. As a first-line predictor, domain meta-predictors could yield improved results with Armadillo predictions.

摘要

蛋白质结构域的识别与注释是准确确定分子功能的关键步骤。蛋白质结构测定的计算方法和实验方法都可能受到大型多结构域蛋白质或柔性连接区域的阻碍。了解结构域及其边界可以让研究人员处理一组更小且可能更成功的替代方案,从而降低蛋白质结构测定的实验成本。当前的结构域预测方法通常依赖于与保守结构域的序列相似性,因此不太适合检测保守性较差或孤儿蛋白中的结构域结构。我们在此提出一种简单的计算方法,仅从序列信息中识别蛋白质结构域连接子及其边界。我们的结构域预测工具犰狳(http://armadillo.blueprint.org)使用任何氨基酸指数将蛋白质序列转换为平滑的数字轮廓,从中可以预测结构域和结构域边界。我们使用非冗余结构数据集从结构域连接子的氨基酸组成中推导了一种名为结构域连接子倾向指数(DLI)的氨基酸指数。该指数表明脯氨酸和甘氨酸显示出作为连接子残基的倾向,而小的疏水残基则不然。犰狳根据Z分数分布预测结构域连接子边界,在双结构域、单连接子数据集中(连接子两侧±20个残基范围内)使用DLI时灵敏度达到35%。DLI与基于熵的氨基酸指数相结合,使犰狳对双结构域蛋白质的总体灵敏度提高到56%。此外,犰狳对多结构域蛋白质的灵敏度达到37%,超过了大多数其他预测方法。犰狳提供了一种简单但有效的方法,通过该方法可以以合理的灵敏度获得结构域边界的预测。对于快速描绘保守性较差的蛋白质或没有序列邻域的蛋白质中的结构域,犰狳应被证明是一个有价值的工具。作为一线预测工具,结构域元预测工具结合犰狳的预测可能会产生更好的结果。

相似文献

1
Armadillo: domain boundary prediction by amino acid composition.犰狳:基于氨基酸组成的结构域边界预测
J Mol Biol. 2005 Jul 29;350(5):1061-73. doi: 10.1016/j.jmb.2005.05.037.
2
Domain boundary prediction based on profile domain linker propensity index.基于序列轮廓结构域连接子倾向指数的结构域边界预测
Comput Biol Chem. 2006 Apr;30(2):127-33. doi: 10.1016/j.compbiolchem.2006.01.001. Epub 2006 Mar 13.
3
SnapDRAGON: a method to delineate protein structural domains from sequence data.SnapDRAGON:一种从序列数据中描绘蛋白质结构域的方法。
J Mol Biol. 2002 Feb 22;316(3):839-51. doi: 10.1006/jmbi.2001.5387.
4
DomNet: protein domain boundary prediction using enhanced general regression network and new profiles.DomNet:使用增强型通用回归网络和新轮廓进行蛋白质结构域边界预测
IEEE Trans Nanobioscience. 2008 Jun;7(2):172-81. doi: 10.1109/TNB.2008.2000747.
5
[Prediction of protein domain boundaries based on statistics of appearance of amino acid residues].基于氨基酸残基出现统计的蛋白质结构域边界预测
Mol Biol (Mosk). 2006 Jan-Feb;40(1):111-21.
6
Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index.利用结构域间连接子指数提高蛋白质结构域边界分配的结构域发现性能。
BMC Bioinformatics. 2006 Dec 18;7 Suppl 5(Suppl 5):S6. doi: 10.1186/1471-2105-7-S5-S6.
7
Identification of putative domain linkers by a neural network - application to a large sequence database.通过神经网络识别假定的结构域连接子——应用于大型序列数据库
BMC Bioinformatics. 2006 Jun 27;7:323. doi: 10.1186/1471-2105-7-323.
8
Improvement of domain linker prediction by incorporating loop-length-dependent characteristics.通过纳入环长度依赖性特征改进结构域连接子预测。
Biopolymers. 2006;84(2):161-8. doi: 10.1002/bip.20361.
9
PPRODO: prediction of protein domain boundaries using neural networks.PPRODO:使用神经网络预测蛋白质结构域边界
Proteins. 2005 May 15;59(3):627-32. doi: 10.1002/prot.20442.
10
Protein structure prediction based on sequence similarity.基于序列相似性的蛋白质结构预测。
Methods Mol Biol. 2009;569:129-56. doi: 10.1007/978-1-59745-524-4_7.

引用本文的文献

1
ThreaDomEx: a unified platform for predicting continuous and discontinuous protein domains by multiple-threading and segment assembly.ThreaDomEx:一个通过多线程和片段组装预测连续和不连续蛋白质结构域的统一平台。
Nucleic Acids Res. 2017 Jul 3;45(W1):W400-W407. doi: 10.1093/nar/gkx410.
2
DIRProt: a computational approach for discriminating insecticide resistant proteins from non-resistant proteins.DIRProt:一种区分抗杀虫剂蛋白和非抗杀虫剂蛋白的计算方法。
BMC Bioinformatics. 2017 Mar 24;18(1):190. doi: 10.1186/s12859-017-1587-y.
3
Fast H-DROP: A thirty times accelerated version of H-DROP for interactive SVM-based prediction of helical domain linkers.
快速H-DROP:H-DROP的30倍加速版本,用于基于支持向量机的螺旋结构域连接子的交互式预测。
J Comput Aided Mol Des. 2017 Feb;31(2):237-244. doi: 10.1007/s10822-016-9999-8. Epub 2016 Dec 27.
4
PDP-CON: prediction of domain/linker residues in protein sequences using a consensus approach.PDP-CON:使用共识方法预测蛋白质序列中的结构域/连接子残基。
J Mol Model. 2016 Apr;22(4):72. doi: 10.1007/s00894-016-2933-0. Epub 2016 Mar 11.
5
H-DROP: an SVM based helical domain linker predictor trained with features optimized by combining random forest and stepwise selection.H-DROP:一种基于支持向量机的螺旋结构域连接子预测器,通过结合随机森林和逐步选择优化特征进行训练。
J Comput Aided Mol Des. 2014 Aug;28(8):831-9. doi: 10.1007/s10822-014-9763-x. Epub 2014 Jun 26.
6
Prediction of aptamer-target interacting pairs with pseudo-amino acid composition.基于伪氨基酸组成预测适体-靶标相互作用对
PLoS One. 2014 Jan 22;9(1):e86729. doi: 10.1371/journal.pone.0086729. eCollection 2014.
7
ThreaDom: extracting protein domain boundary information from multiple threading alignments.ThreaDom:从多重序列比对中提取蛋白质结构域边界信息。
Bioinformatics. 2013 Jul 1;29(13):i247-56. doi: 10.1093/bioinformatics/btt209.
8
IS-Dom: a dataset of independent structural domains automatically delineated from protein structures.IS-Dom:一个从蛋白质结构中自动划分的独立结构域数据集。
J Comput Aided Mol Des. 2013 May;27(5):419-26. doi: 10.1007/s10822-013-9654-6. Epub 2013 May 29.
9
DomHR: accurately identifying domain boundaries in proteins using a hinge region strategy.使用铰链区策略准确识别蛋白质的结构域边界。
PLoS One. 2013 Apr 11;8(4):e60559. doi: 10.1371/journal.pone.0060559. Print 2013.
10
Prediction of antimicrobial peptides based on sequence alignment and feature selection methods.基于序列比对和特征选择方法的抗菌肽预测。
PLoS One. 2011 Apr 13;6(4):e18476. doi: 10.1371/journal.pone.0018476.