• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

快速H-DROP:H-DROP的30倍加速版本,用于基于支持向量机的螺旋结构域连接子的交互式预测。

Fast H-DROP: A thirty times accelerated version of H-DROP for interactive SVM-based prediction of helical domain linkers.

作者信息

Richa Tambi, Ide Soichiro, Suzuki Ryosuke, Ebina Teppei, Kuroda Yutaka

机构信息

Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, 12-24-16 Nakamachi, Koganei-shi, Tokyo, 184-8588, Japan.

Department of Physiology, Graduate school of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan.

出版信息

J Comput Aided Mol Des. 2017 Feb;31(2):237-244. doi: 10.1007/s10822-016-9999-8. Epub 2016 Dec 27.

DOI:10.1007/s10822-016-9999-8
PMID:28028736
Abstract

Efficient and rapid prediction of domain regions from amino acid sequence information alone is often required for swift structural and functional characterization of large multi-domain proteins. Here we introduce Fast H-DROP, a thirty times accelerated version of our previously reported H-DROP (Helical Domain linker pRediction using OPtimal features), which is unique in specifically predicting helical domain linkers (boundaries). Fast H-DROP, analogously to H-DROP, uses optimum features selected from a set of 3000 ones by combining a random forest and a stepwise feature selection protocol. We reduced the computational time from 8.5 min per sequence in H-DROP to 14 s per sequence in Fast H-DROP on an 8 Xeon processor Linux server by using SWISS-PROT instead of Genbank non-redundant (nr) database for generating the PSSMs. The sensitivity and precision of Fast H-DROP assessed by cross-validation were 33.7 and 36.2%, which were merely ~2% lower than that of H-DROP. The reduced computational time of Fast H-DROP, without affecting prediction performances, makes it more interactive and user-friendly. Fast H-DROP and H-DROP are freely available from http://domserv.lab.tuat.ac.jp/ .

摘要

仅根据氨基酸序列信息高效快速地预测结构域区域,对于大型多结构域蛋白质的快速结构和功能表征通常是必需的。在此,我们介绍Fast H-DROP,它是我们之前报道的H-DROP(使用最优特征预测螺旋结构域连接子)的加速30倍版本,其独特之处在于专门预测螺旋结构域连接子(边界)。与H-DROP类似,Fast H-DROP通过结合随机森林和逐步特征选择协议,使用从3000个特征中选出的最优特征。通过使用SWISS-PROT而非Genbank非冗余(nr)数据库来生成位置特异性得分矩阵(PSSM),我们在8个至强处理器的Linux服务器上,将计算时间从H-DROP中每个序列8.5分钟减少到Fast H-DROP中的每个序列14秒。通过交叉验证评估,Fast H-DROP的灵敏度和精确率分别为33.7%和36.2%,仅比H-DROP低约2%。Fast H-DROP在不影响预测性能的情况下减少了计算时间,使其更具交互性和用户友好性。Fast H-DROP和H-DROP可从http://domserv.lab.tuat.ac.jp/免费获取。

相似文献

1
Fast H-DROP: A thirty times accelerated version of H-DROP for interactive SVM-based prediction of helical domain linkers.快速H-DROP:H-DROP的30倍加速版本,用于基于支持向量机的螺旋结构域连接子的交互式预测。
J Comput Aided Mol Des. 2017 Feb;31(2):237-244. doi: 10.1007/s10822-016-9999-8. Epub 2016 Dec 27.
2
H-DROP: an SVM based helical domain linker predictor trained with features optimized by combining random forest and stepwise selection.H-DROP:一种基于支持向量机的螺旋结构域连接子预测器,通过结合随机森林和逐步选择优化特征进行训练。
J Comput Aided Mol Des. 2014 Aug;28(8):831-9. doi: 10.1007/s10822-014-9763-x. Epub 2014 Jun 26.
3
DROP: an SVM domain linker predictor trained with optimal features selected by random forest.DROP:一种使用随机森林选择的最佳特征训练的 SVM 域链接器预测器。
Bioinformatics. 2011 Feb 15;27(4):487-94. doi: 10.1093/bioinformatics/btq700. Epub 2010 Dec 17.
4
Loop-length-dependent SVM prediction of domain linkers for high-throughput structural proteomics.用于高通量结构蛋白质组学的结构域连接子的环长依赖性支持向量机预测
Biopolymers. 2009;92(1):1-8. doi: 10.1002/bip.21105.
5
PSSM-based prediction of DNA binding sites in proteins.基于位置特异性得分矩阵的蛋白质中DNA结合位点预测
BMC Bioinformatics. 2005 Feb 19;6:33. doi: 10.1186/1471-2105-6-33.
6
Armadillo: domain boundary prediction by amino acid composition.犰狳:基于氨基酸组成的结构域边界预测
J Mol Biol. 2005 Jul 29;350(5):1061-73. doi: 10.1016/j.jmb.2005.05.037.
7
FastRNABindR: Fast and Accurate Prediction of Protein-RNA Interface Residues.FastRNABindR:蛋白质-RNA 界面残基的快速准确预测
PLoS One. 2016 Jul 6;11(7):e0158445. doi: 10.1371/journal.pone.0158445. eCollection 2016.
8
DOMAC: an accurate, hybrid protein domain prediction server.DOMAC:一个准确的混合蛋白质结构域预测服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W354-6. doi: 10.1093/nar/gkm390. Epub 2007 Jun 6.
9
Domain boundary prediction based on profile domain linker propensity index.基于序列轮廓结构域连接子倾向指数的结构域边界预测
Comput Biol Chem. 2006 Apr;30(2):127-33. doi: 10.1016/j.compbiolchem.2006.01.001. Epub 2006 Mar 13.
10
Characteristics and prediction of domain linker sequences in multi-domain proteins.多结构域蛋白中结构域连接子序列的特征与预测
J Struct Funct Genomics. 2003;4(2-3):79-85. doi: 10.1023/a:1026163008203.

本文引用的文献

1
FastRNABindR: Fast and Accurate Prediction of Protein-RNA Interface Residues.FastRNABindR:蛋白质-RNA 界面残基的快速准确预测
PLoS One. 2016 Jul 6;11(7):e0158445. doi: 10.1371/journal.pone.0158445. eCollection 2016.
2
Prediction of nucleic acid binding probability in proteins: a neighboring residue network based score.蛋白质中核酸结合概率的预测:基于相邻残基网络的评分
Nucleic Acids Res. 2015 Jun 23;43(11):5340-51. doi: 10.1093/nar/gkv446. Epub 2015 May 4.
3
Prediction of membrane transport proteins and their substrate specificities using primary sequence information.
利用一级序列信息预测膜转运蛋白及其底物特异性。
PLoS One. 2014 Jun 26;9(6):e100278. doi: 10.1371/journal.pone.0100278. eCollection 2014.
4
H-DROP: an SVM based helical domain linker predictor trained with features optimized by combining random forest and stepwise selection.H-DROP:一种基于支持向量机的螺旋结构域连接子预测器,通过结合随机森林和逐步选择优化特征进行训练。
J Comput Aided Mol Des. 2014 Aug;28(8):831-9. doi: 10.1007/s10822-014-9763-x. Epub 2014 Jun 26.
5
ThreaDom: extracting protein domain boundary information from multiple threading alignments.ThreaDom:从多重序列比对中提取蛋白质结构域边界信息。
Bioinformatics. 2013 Jul 1;29(13):i247-56. doi: 10.1093/bioinformatics/btt209.
6
IS-Dom: a dataset of independent structural domains automatically delineated from protein structures.IS-Dom:一个从蛋白质结构中自动划分的独立结构域数据集。
J Comput Aided Mol Des. 2013 May;27(5):419-26. doi: 10.1007/s10822-013-9654-6. Epub 2013 May 29.
7
Structure of LdtMt2, an L,D-transpeptidase from Mycobacterium tuberculosis.结核分枝杆菌的L,D-转肽酶LdtMt2的结构
Acta Crystallogr D Biol Crystallogr. 2013 Mar;69(Pt 3):432-41. doi: 10.1107/S0907444912049268. Epub 2013 Feb 16.
8
Linkers in the structural biology of protein-protein interactions.蛋白质-蛋白质相互作用结构生物学中的连接子。
Protein Sci. 2013 Feb;22(2):153-67. doi: 10.1002/pro.2206. Epub 2013 Jan 8.
9
Molecular adaptation of the DegQ protease to exert protein quality control in the bacterial cell envelope.DegQ 蛋白酶的分子适应在细菌细胞包膜中发挥蛋白质质量控制作用。
J Biol Chem. 2011 Sep 2;286(35):30680-30690. doi: 10.1074/jbc.M111.243832. Epub 2011 Jun 17.
10
SH3YL1 regulates dorsal ruffle formation by a novel phosphoinositide-binding domain.SH3YL1 通过一种新型的磷酯结合域调节背侧皱襞的形成。
J Cell Biol. 2011 May 30;193(5):901-16. doi: 10.1083/jcb.201012161.