• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用铰链区策略准确识别蛋白质的结构域边界。

DomHR: accurately identifying domain boundaries in proteins using a hinge region strategy.

机构信息

Department of Chemistry, Tongji University, Shanghai, China.

出版信息

PLoS One. 2013 Apr 11;8(4):e60559. doi: 10.1371/journal.pone.0060559. Print 2013.

DOI:10.1371/journal.pone.0060559
PMID:23593247
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3623903/
Abstract

MOTIVATION

The precise prediction of protein domains, which are the structural, functional and evolutionary units of proteins, has been a research focus in recent years. Although many methods have been presented for predicting protein domains and boundaries, the accuracy of predictions could be improved.

RESULTS

In this study we present a novel approach, DomHR, which is an accurate predictor of protein domain boundaries based on a creative hinge region strategy. A hinge region was defined as a segment of amino acids that covers part of a domain region and a boundary region. We developed a strategy to construct profiles of domain-hinge-boundary (DHB) features generated by sequence-domain/hinge/boundary alignment against a database of known domain structures. The DHB features had three elements: normalized domain, hinge, and boundary probabilities. The DHB features were used as input to identify domain boundaries in a sequence. DomHR used a nonredundant dataset as the training set, the DHB and predicted shape string as features, and a conditional random field as the classification algorithm. In predicted hinge regions, a residue was determined to be a domain or a boundary according to a decision threshold. After decision thresholds were optimized, DomHR was evaluated by cross-validation, large-scale prediction, independent test and CASP (Critical Assessment of Techniques for Protein Structure Prediction) tests. All results confirmed that DomHR outperformed other well-established, publicly available domain boundary predictors for prediction accuracy.

AVAILABILITY

The DomHR is available at http://cal.tongji.edu.cn/domain/.

摘要

动机

蛋白质结构域的精确预测,即蛋白质的结构、功能和进化单位,是近年来的研究热点。尽管已经提出了许多用于预测蛋白质结构域和边界的方法,但预测的准确性仍有待提高。

结果

在这项研究中,我们提出了一种新的方法 DomHR,它是一种基于铰链区域策略的蛋白质结构域边界的准确预测器。铰链区被定义为覆盖部分结构域和边界区域的氨基酸片段。我们开发了一种策略,用于构建通过序列-结构域/铰链/边界比对生成的结构域-铰链-边界(DHB)特征的轮廓,这些特征与已知结构域结构的数据库相对应。DHB 特征有三个元素:归一化结构域、铰链和边界概率。DHB 特征被用作识别序列中结构域边界的输入。DomHR 使用非冗余数据集作为训练集,DHB 和预测形状字符串作为特征,条件随机场作为分类算法。在预测的铰链区中,根据决策阈值确定残基是结构域还是边界。在优化决策阈值后,通过交叉验证、大规模预测、独立测试和 CASP(蛋白质结构预测技术的关键评估)测试对 DomHR 进行了评估。所有结果都证实,与其他成熟的、可公开获得的结构域边界预测器相比,DomHR 在预测准确性方面表现更为出色。

可用性

DomHR 可在 http://cal.tongji.edu.cn/domain/ 上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68da/3623903/a5d5c61ef338/pone.0060559.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68da/3623903/37457ab0c6f5/pone.0060559.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68da/3623903/d448c9a27c74/pone.0060559.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68da/3623903/a5d796de801c/pone.0060559.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68da/3623903/a5d5c61ef338/pone.0060559.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68da/3623903/37457ab0c6f5/pone.0060559.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68da/3623903/d448c9a27c74/pone.0060559.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68da/3623903/a5d796de801c/pone.0060559.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68da/3623903/a5d5c61ef338/pone.0060559.g004.jpg

相似文献

1
DomHR: accurately identifying domain boundaries in proteins using a hinge region strategy.使用铰链区策略准确识别蛋白质的结构域边界。
PLoS One. 2013 Apr 11;8(4):e60559. doi: 10.1371/journal.pone.0060559. Print 2013.
2
DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach.DisoMCS:使用多类保守评分方法准确预测蛋白质内在无序区域
PLoS One. 2015 Jun 19;10(6):e0128334. doi: 10.1371/journal.pone.0128334. eCollection 2015.
3
Ab initio and homology based prediction of protein domains by recursive neural networks.利用递归神经网络对蛋白质结构域进行从头预测和基于同源性的预测。
BMC Bioinformatics. 2009 Jun 26;10:195. doi: 10.1186/1471-2105-10-195.
4
Evaluation of domain prediction in CASP6.CASP6中结构域预测的评估
Proteins. 2005;61 Suppl 7:183-192. doi: 10.1002/prot.20736.
5
Domain boundary prediction based on profile domain linker propensity index.基于序列轮廓结构域连接子倾向指数的结构域边界预测
Comput Biol Chem. 2006 Apr;30(2):127-33. doi: 10.1016/j.compbiolchem.2006.01.001. Epub 2006 Mar 13.
6
SnapDRAGON: a method to delineate protein structural domains from sequence data.SnapDRAGON:一种从序列数据中描绘蛋白质结构域的方法。
J Mol Biol. 2002 Feb 22;316(3):839-51. doi: 10.1006/jmbi.2001.5387.
7
Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM.使用Ginzu和RosettaDOM自动预测CASP6目标中的结构域边界。
Proteins. 2005;61 Suppl 7:193-200. doi: 10.1002/prot.20737.
8
Prediction of protein domain boundaries from sequence alone.仅从序列预测蛋白质结构域边界。
Protein Sci. 2003 Apr;12(4):696-701. doi: 10.1110/ps.0233103.
9
H-DROP: an SVM based helical domain linker predictor trained with features optimized by combining random forest and stepwise selection.H-DROP:一种基于支持向量机的螺旋结构域连接子预测器,通过结合随机森林和逐步选择优化特征进行训练。
J Comput Aided Mol Des. 2014 Aug;28(8):831-9. doi: 10.1007/s10822-014-9763-x. Epub 2014 Jun 26.
10
StoneHinge: hinge prediction by network analysis of individual protein structures.StoneHinge:通过对单个蛋白质结构进行网络分析来预测铰链区
Protein Sci. 2009 Feb;18(2):359-71. doi: 10.1002/pro.38.

引用本文的文献

1
Multi-head attention-based U-Nets for predicting protein domain boundaries using 1D sequence features and 2D distance maps.基于多头注意力的 U-Net 模型,利用 1D 序列特征和 2D 距离图预测蛋白质结构域边界。
BMC Bioinformatics. 2022 Jul 19;23(1):283. doi: 10.1186/s12859-022-04829-1.
2
Protein domain identification methods and online resources.蛋白质结构域鉴定方法及在线资源。
Comput Struct Biotechnol J. 2021 Feb 2;19:1145-1153. doi: 10.1016/j.csbj.2021.01.041. eCollection 2021.
3
PDP-CON: prediction of domain/linker residues in protein sequences using a consensus approach.

本文引用的文献

1
Prediction of protein domain with mRMR feature selection and analysis.基于 mRMR 特征选择的蛋白质结构域预测及分析。
PLoS One. 2012;7(6):e39308. doi: 10.1371/journal.pone.0039308. Epub 2012 Jun 15.
2
DSP: a protein shape string and its profile prediction server.DSP:蛋白质形状字符串及其轮廓预测服务器。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W298-302. doi: 10.1093/nar/gks361. Epub 2012 May 2.
3
SPINE-D: accurate prediction of short and long disordered regions by a single neural-network based method.SPINE-D:基于单个神经网络的方法准确预测短和长的无序区域。
PDP-CON:使用共识方法预测蛋白质序列中的结构域/连接子残基。
J Mol Model. 2016 Apr;22(4):72. doi: 10.1007/s00894-016-2933-0. Epub 2016 Mar 11.
4
Analysis and prediction of the critical regions of antimicrobial peptides based on conditional random fields.基于条件随机场的抗菌肽关键区域分析与预测
PLoS One. 2015 Mar 24;10(3):e0119490. doi: 10.1371/journal.pone.0119490. eCollection 2015.
5
Sequence based prediction of DNA-binding proteins based on hybrid feature selection using random forest and Gaussian naïve Bayes.基于随机森林和高斯朴素贝叶斯混合特征选择的DNA结合蛋白序列预测
PLoS One. 2014 Jan 24;9(1):e86703. doi: 10.1371/journal.pone.0086703. eCollection 2014.
6
NMRDSP: an accurate prediction of protein shape strings from NMR chemical shifts and sequence data.NMRDSP:基于核磁共振化学位移和序列数据对蛋白质形状字符串进行准确预测
PLoS One. 2013 Dec 23;8(12):e83532. doi: 10.1371/journal.pone.0083532. eCollection 2013.
J Biomol Struct Dyn. 2012;29(4):799-813. doi: 10.1080/073911012010525022.
4
A novel structural position-specific scoring matrix for the prediction of protein secondary structures.一种新型结构位置特异性评分矩阵,用于预测蛋白质二级结构。
Bioinformatics. 2012 Jan 1;28(1):32-9. doi: 10.1093/bioinformatics/btr611. Epub 2011 Nov 7.
5
Improving the performance of β-turn prediction using predicted shape strings and a two-layer support vector machine model.利用预测形状字符串和两层支持向量机模型提高 β-转角预测的性能。
BMC Bioinformatics. 2011 Jul 13;12:283. doi: 10.1186/1471-2105-12-283.
6
Using predicted shape string to enhance the accuracy of γ-turn prediction.利用预测的形状字符串提高 γ-转角预测的准确性。
Amino Acids. 2012 May;42(5):1749-55. doi: 10.1007/s00726-011-0889-z. Epub 2011 Mar 22.
7
Protein domain assignment from the recurrence of locally similar structures.基于局部相似结构的重复对蛋白质结构域进行分配。
Proteins. 2011 Mar;79(3):853-66. doi: 10.1002/prot.22923. Epub 2010 Dec 22.
8
DoBo: Protein domain boundary prediction by integrating evolutionary signals and machine learning.多宝:通过整合进化信号和机器学习进行蛋白质结构域边界预测。
BMC Bioinformatics. 2011 Feb 1;12:43. doi: 10.1186/1471-2105-12-43.
9
DROP: an SVM domain linker predictor trained with optimal features selected by random forest.DROP:一种使用随机森林选择的最佳特征训练的 SVM 域链接器预测器。
Bioinformatics. 2011 Feb 15;27(4):487-94. doi: 10.1093/bioinformatics/btq700. Epub 2010 Dec 17.
10
DomSVR: domain boundary prediction with support vector regression from sequence information alone.DomSVR:仅从序列信息进行支持向量回归的域边界预测。
Amino Acids. 2010 Aug;39(3):713-26. doi: 10.1007/s00726-010-0506-6. Epub 2010 Feb 18.