• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于蛋白质结构域边界整合分析的模块化内核方法。

A modular kernel approach for integrative analysis of protein domain boundaries.

机构信息

Advanced Networks Research Group, School of Information Technologies (J12), the University of Sydney, NSW 2006, Australia.

出版信息

BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S21. doi: 10.1186/1471-2164-10-S3-S21.

DOI:10.1186/1471-2164-10-S3-S21
PMID:19958485
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2788374/
Abstract

BACKGROUND

In this paper, we introduce a novel inter-range interaction integrated approach for protein domain boundary prediction. It involves (1) the design of modular kernel algorithm, which is able to effectively exploit the information of non-local interactions in amino acids, and (2) the development of a novel profile that can provide suitable information to the algorithm. One of the key features of this profiling technique is the use of multiple structural alignments of remote homologues to create an extended sequence profile and combines the structural information with suitable chemical information that plays an important role in protein stability. This profile can capture the sequence characteristics of an entire structural superfamily and extend a range of profiles generated from sequence similarity alone.

RESULTS

Our novel profile that combines homology information with hydrophobicity from SARAH1 scale was successful in providing more structural and chemical information. In addition, the modular approach adopted in our algorithm proved to be effective in capturing information from non-local interactions. Our approach achieved 82.1%, 50.9% and 31.5% accuracies for one-domain, two-domain, and three- and more domain proteins respectively.

CONCLUSION

The experimental results in this study are encouraging, however, more work is need to extend it to a broader range of applications. We are currently developing a novel interactive (human in the loop) profiling that can provide information from more distantly related homology. This approach will further enhance the current study.

摘要

背景

在本文中,我们介绍了一种新颖的基于跨区间相互作用的蛋白质结构域边界预测方法。该方法包括:(1)模块化核算法的设计,该算法能够有效地利用氨基酸的非局部相互作用信息;(2)新型轮廓的开发,该轮廓可以为算法提供合适的信息。该轮廓技术的一个关键特征是使用多个远程同源物的结构比对来创建扩展的序列轮廓,并将结构信息与合适的化学信息相结合,这些化学信息在蛋白质稳定性中起着重要作用。该轮廓可以捕获整个结构超家族的序列特征,并扩展仅基于序列相似性生成的一系列轮廓。

结果

我们的新型轮廓将同源信息与 SARAH1 标度的疏水性相结合,成功地提供了更多的结构和化学信息。此外,我们算法中采用的模块化方法在捕获非局部相互作用信息方面也非常有效。我们的方法在单结构域、双结构域以及三结构域和更多结构域蛋白的预测中分别实现了 82.1%、50.9%和 31.5%的准确率。

结论

本研究的实验结果令人鼓舞,但需要进一步的工作将其扩展到更广泛的应用领域。我们目前正在开发一种新的交互式(人机交互)轮廓技术,可以从更远缘的同源物中获取信息。这种方法将进一步增强当前的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2b2/2788374/d71fae36130f/1471-2164-10-S3-S21-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2b2/2788374/5e20dc553963/1471-2164-10-S3-S21-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2b2/2788374/d71fae36130f/1471-2164-10-S3-S21-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2b2/2788374/5e20dc553963/1471-2164-10-S3-S21-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2b2/2788374/d71fae36130f/1471-2164-10-S3-S21-2.jpg

相似文献

1
A modular kernel approach for integrative analysis of protein domain boundaries.一种用于蛋白质结构域边界整合分析的模块化内核方法。
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S21. doi: 10.1186/1471-2164-10-S3-S21.
2
Use of covariance analysis for the prediction of structural domain boundaries from multiple protein sequence alignments.利用协方差分析从多个蛋白质序列比对预测结构域边界。
Protein Eng. 2002 Feb;15(2):65-77. doi: 10.1093/protein/15.2.65.
3
SnapDRAGON: a method to delineate protein structural domains from sequence data.SnapDRAGON:一种从序列数据中描绘蛋白质结构域的方法。
J Mol Biol. 2002 Feb 22;316(3):839-51. doi: 10.1006/jmbi.2001.5387.
4
Amino acid alphabet reduction preserves fold information contained in contact interactions in proteins.氨基酸字母表精简保留了蛋白质中接触相互作用所包含的折叠信息。
Proteins. 2015 Dec;83(12):2198-216. doi: 10.1002/prot.24936.
5
Domain boundary prediction based on profile domain linker propensity index.基于序列轮廓结构域连接子倾向指数的结构域边界预测
Comput Biol Chem. 2006 Apr;30(2):127-33. doi: 10.1016/j.compbiolchem.2006.01.001. Epub 2006 Mar 13.
6
Protein inter-domain linker prediction using Random Forest and amino acid physiochemical properties.利用随机森林和氨基酸理化性质进行蛋白质结构域间连接子预测。
BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S8. doi: 10.1186/1471-2105-15-S16-S8. Epub 2014 Dec 8.
7
Context similarity scoring improves protein sequence alignments in the midnight zone.语境相似性评分提高了午夜区的蛋白质序列比对。
Bioinformatics. 2015 Mar 1;31(5):674-81. doi: 10.1093/bioinformatics/btu697. Epub 2014 Oct 22.
8
Armadillo: domain boundary prediction by amino acid composition.犰狳:基于氨基酸组成的结构域边界预测
J Mol Biol. 2005 Jul 29;350(5):1061-73. doi: 10.1016/j.jmb.2005.05.037.
9
Within the twilight zone: a sensitive profile-profile comparison tool based on information theory.在模糊区域内:一种基于信息论的灵敏的轮廓-轮廓比较工具。
J Mol Biol. 2002 Feb 1;315(5):1257-75. doi: 10.1006/jmbi.2001.5293.
10
High quality protein sequence alignment by combining structural profile prediction and profile alignment using SABER-TOOTH.使用 SABER-TOOTH 结合结构轮廓预测和轮廓比对进行高质量蛋白质序列比对。
BMC Bioinformatics. 2010 May 14;11:251. doi: 10.1186/1471-2105-11-251.

引用本文的文献

1
Protein inter-domain linker prediction using Random Forest and amino acid physiochemical properties.利用随机森林和氨基酸理化性质进行蛋白质结构域间连接子预测。
BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S8. doi: 10.1186/1471-2105-15-S16-S8. Epub 2014 Dec 8.
2
Extending Asia Pacific bioinformatics into new realms in the "-omics" era.将亚太生物信息学拓展到“组学”时代的新领域。
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S1. doi: 10.1186/1471-2164-10-S3-S1.

本文引用的文献

1
DomNet: protein domain boundary prediction using enhanced general regression network and new profiles.DomNet:使用增强型通用回归网络和新轮廓进行蛋白质结构域边界预测
IEEE Trans Nanobioscience. 2008 Jun;7(2):172-81. doi: 10.1109/TNB.2008.2000747.
2
SiteSeek: post-translational modification analysis using adaptive locality-effective kernel methods and new profiles.SiteSeek:使用自适应局部有效核方法和新概况进行翻译后修饰分析。
BMC Bioinformatics. 2008 Jun 10;9:272. doi: 10.1186/1471-2105-9-272.
3
Improved general regression network for protein domain boundary prediction.
用于蛋白质结构域边界预测的改进型通用回归网络。
BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S12. doi: 10.1186/1471-2105-9-S1-S12.
4
Learning long-term dependencies with gradient descent is difficult.使用梯度下降法学习长期依赖关系是困难的。
IEEE Trans Neural Netw. 1994;5(2):157-66. doi: 10.1109/72.279181.
5
Cascaded bidirectional recurrent neural networks for protein secondary structure prediction.用于蛋白质二级结构预测的级联双向递归神经网络
IEEE/ACM Trans Comput Biol Bioinform. 2007 Oct-Dec;4(4):572-582. doi: 10.1109/tcbb.2007.1055.
6
Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint.迈向已完成基因组的全面结构覆盖:结构基因组学视角
BMC Bioinformatics. 2007 Mar 9;8:86. doi: 10.1186/1471-2105-8-86.
7
Partitioning protein structures into domains: why is it so difficult?将蛋白质结构划分为结构域:为何如此困难?
J Mol Biol. 2006 Aug 18;361(3):562-90. doi: 10.1016/j.jmb.2006.05.060. Epub 2006 Jun 22.
8
Multiple sequence alignment.多序列比对
Curr Opin Struct Biol. 2006 Jun;16(3):368-73. doi: 10.1016/j.sbi.2006.04.004. Epub 2006 May 5.
9
Phosphorylation and concomitant structural changes in human 2-Cys peroxiredoxin isotype I differentially regulate its peroxidase and molecular chaperone functions.人I型2-半胱氨酸过氧化物酶中的磷酸化及伴随的结构变化对其过氧化物酶和分子伴侣功能有不同的调节作用。
FEBS Lett. 2006 Jan 9;580(1):351-5. doi: 10.1016/j.febslet.2005.12.030. Epub 2005 Dec 19.
10
PROFcon: novel prediction of long-range contacts.PROFcon:长程接触的新型预测方法
Bioinformatics. 2005 Jul 1;21(13):2960-8. doi: 10.1093/bioinformatics/bti454. Epub 2005 May 12.