Suppr超能文献

DomSVR:仅从序列信息进行支持向量回归的域边界预测。

DomSVR: domain boundary prediction with support vector regression from sequence information alone.

机构信息

Department of Systems and Computer Science, Howard University, 2400 Sixth Street, NW, Washington, DC 20059, USA.

出版信息

Amino Acids. 2010 Aug;39(3):713-26. doi: 10.1007/s00726-010-0506-6. Epub 2010 Feb 18.

Abstract

Protein domains are structural and fundamental functional units of proteins. The information of protein domain boundaries is helpful in understanding the evolution, structures and functions of proteins, and also plays an important role in protein classification. In this paper, we propose a support vector regression-based method to address the problem of protein domain boundary identification based on novel input profiles extracted from AAindex database. As a result, our method achieves an average sensitivity of approximately 36.5% and an average specificity of approximately 81% for multi-domain protein chains, which is overall better than the performance of published approaches to identify domain boundary. As our method used sequence information alone, our method is simpler and faster.

摘要

蛋白质结构域是蛋白质的结构和基本功能单位。蛋白质结构域边界的信息有助于理解蛋白质的进化、结构和功能,并且在蛋白质分类中也起着重要的作用。在本文中,我们提出了一种基于支持向量回归的方法,该方法基于从 AAindex 数据库中提取的新输入谱来解决蛋白质结构域边界识别问题。结果表明,对于多结构域蛋白质链,我们的方法的平均灵敏度约为 36.5%,平均特异性约为 81%,总体上优于已发表的识别结构域边界的方法。由于我们的方法仅使用序列信息,因此我们的方法更简单、更快。

相似文献

3
Domain boundary prediction based on profile domain linker propensity index.基于序列轮廓结构域连接子倾向指数的结构域边界预测
Comput Biol Chem. 2006 Apr;30(2):127-33. doi: 10.1016/j.compbiolchem.2006.01.001. Epub 2006 Mar 13.
10
Rebelling for a reason: protein structural "outliers".有因有果的反抗:蛋白质结构的“异类”。
PLoS One. 2013 Sep 20;8(9):e74416. doi: 10.1371/journal.pone.0074416. eCollection 2013.

引用本文的文献

3
Protein domain identification methods and online resources.蛋白质结构域鉴定方法及在线资源。
Comput Struct Biotechnol J. 2021 Feb 2;19:1145-1153. doi: 10.1016/j.csbj.2021.01.041. eCollection 2021.
9
The MULTICOM toolbox for protein structure prediction.MULTICOM 蛋白质结构预测工具箱。
BMC Bioinformatics. 2012 Apr 30;13:65. doi: 10.1186/1471-2105-13-65.

本文引用的文献

2
AAindex: amino acid index database, progress report 2008.AAindex:氨基酸索引数据库,2008年进展报告。
Nucleic Acids Res. 2008 Jan;36(Database issue):D202-5. doi: 10.1093/nar/gkm998. Epub 2007 Nov 12.
7
CDD: a conserved domain database for interactive domain family analysis.CDD:用于交互式结构域家族分析的保守结构域数据库。
Nucleic Acids Res. 2007 Jan;35(Database issue):D237-40. doi: 10.1093/nar/gkl951. Epub 2006 Nov 29.
9
Protein structure prediction servers at University College London.伦敦大学学院的蛋白质结构预测服务器。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W36-8. doi: 10.1093/nar/gki410.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验