• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
A General Method for Predicting Amino Acid Residues Experiencing Hydrogen Exchange.一种预测发生氢交换的氨基酸残基的通用方法。
IEEE EMBS Int Conf Biomed Health Inform. 2018 Mar;2018:341-344. doi: 10.1109/BHI.2018.8333438. Epub 2018 Apr 9.
2
Start2Fold: a database of hydrogen/deuterium exchange data on protein folding and stability.Start2Fold:一个关于蛋白质折叠和稳定性的氢/氘交换数据数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D429-34. doi: 10.1093/nar/gkv1185. Epub 2015 Nov 17.
3
Prediction of amino acid residues protected from hydrogen-deuterium exchange in a protein chain.预测蛋白质链中受氢-氘交换保护的氨基酸残基。
Biochemistry (Mosc). 2009 Aug;74(8):888-97. doi: 10.1134/s0006297909080100.
4
Rigorous assessment and integration of the sequence and structure based features to predict hot spots.严格评估和整合基于序列和结构的特征,以预测热点。
BMC Bioinformatics. 2011 Jul 29;12:311. doi: 10.1186/1471-2105-12-311.
5
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
6
Predicting DNA-binding sites of proteins from amino acid sequence.从氨基酸序列预测蛋白质的DNA结合位点。
BMC Bioinformatics. 2006 May 19;7:262. doi: 10.1186/1471-2105-7-262.
7
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
8
Structure-based prediction of protein- peptide binding regions using Random Forest.基于结构的随机森林预测蛋白肽结合区域。
Bioinformatics. 2018 Feb 1;34(3):477-484. doi: 10.1093/bioinformatics/btx614.
9
PREvaIL, an integrative approach for inferring catalytic residues using sequence, structural, and network features in a machine-learning framework.PREvaIL,一种基于机器学习框架,使用序列、结构和网络特征推断催化残基的综合方法。
J Theor Biol. 2018 Apr 14;443:125-137. doi: 10.1016/j.jtbi.2018.01.023. Epub 2018 Feb 1.
10
A Machine Learning-Based Model to Predict Acute Traumatic Coagulopathy in Trauma Patients Upon Emergency Hospitalization.基于机器学习的模型预测创伤患者急诊住院时的急性创伤性凝血病。
Clin Appl Thromb Hemost. 2020 Jan-Dec;26:1076029619897827. doi: 10.1177/1076029619897827.

本文引用的文献

1
Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins.探索基于序列的蛋白质折叠起始位点预测。
Sci Rep. 2017 Aug 18;7(1):8826. doi: 10.1038/s41598-017-08366-3.
2
PROPKA3: Consistent Treatment of Internal and Surface Residues in Empirical pKa Predictions.PROPKA3:经验 pKa 预测中内部残基和表面残基的一致处理。
J Chem Theory Comput. 2011 Feb 8;7(2):525-37. doi: 10.1021/ct100578z. Epub 2011 Jan 6.
3
Start2Fold: a database of hydrogen/deuterium exchange data on protein folding and stability.Start2Fold:一个关于蛋白质折叠和稳定性的氢/氘交换数据数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D429-34. doi: 10.1093/nar/gkv1185. Epub 2015 Nov 17.
4
Applications of hydrogen/deuterium exchange MS from 2012 to 2014.2012年至2014年氢/氘交换质谱的应用。
Anal Chem. 2015 Jan 6;87(1):99-118. doi: 10.1021/ac5040242. Epub 2014 Nov 14.
5
Maximum allowed solvent accessibilites of residues in proteins.蛋白质中残基的最大允许溶剂可及性。
PLoS One. 2013 Nov 21;8(11):e80635. doi: 10.1371/journal.pone.0080635. eCollection 2013.
6
Hydrogen-exchange mass spectrometry for the study of intrinsic disorder in proteins.用于研究蛋白质内在无序性的氢交换质谱法。
Biochim Biophys Acta. 2013 Jun;1834(6):1202-9. doi: 10.1016/j.bbapap.2012.10.009. Epub 2012 Oct 22.
7
Letter to the editor: Stability of Random Forest importance measures.致编辑的信:随机森林重要性度量的稳定性。
Brief Bioinform. 2011 Jan;12(1):86-9. doi: 10.1093/bib/bbq011. Epub 2010 Mar 31.
8
Stability and fluctuations of amide hydrogen bonds in a bacterial cytochrome c: a molecular dynamics study.细菌细胞色素c中酰胺氢键的稳定性与波动:一项分子动力学研究
J Biol Inorg Chem. 2006 Jan;11(1):26-40. doi: 10.1007/s00775-005-0041-1. Epub 2005 Nov 16.
9
Rare fluctuations of native proteins sampled by equilibrium hydrogen exchange.
J Am Chem Soc. 2003 Dec 24;125(51):15686-7. doi: 10.1021/ja036523z.
10
CASTp: Computed Atlas of Surface Topography of proteins.CASTp:蛋白质表面拓扑结构计算图谱
Nucleic Acids Res. 2003 Jul 1;31(13):3352-5. doi: 10.1093/nar/gkg512.

一种预测发生氢交换的氨基酸残基的通用方法。

A General Method for Predicting Amino Acid Residues Experiencing Hydrogen Exchange.

作者信息

Wang Boshen, Perez-Rathke Alan, Li Renhao, Liang Jie

机构信息

Bioinformatics Program, Department of Bioengineering, University of Illinois at Chicago, Chicago, IL 60607, USA.

Aflac Cancer and Blood Disorders Center, Department of Pediatrics, Emory University School of Medicine, Atlanta, GA 30322, USA.

出版信息

IEEE EMBS Int Conf Biomed Health Inform. 2018 Mar;2018:341-344. doi: 10.1109/BHI.2018.8333438. Epub 2018 Apr 9.

DOI:10.1109/BHI.2018.8333438
PMID:29780972
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5957487/
Abstract

Information on protein hydrogen exchange can help delineate key regions involved in protein-protein interactions and provides important insight towards determining functional roles of genetic variants and their possible mechanisms in disease processes. Previous studies have shown that the degree of hydrogen exchange is affected by hydrogen bond formations, solvent accessibility, proximity to other residues, and experimental conditions. However, a general predictive method for identifying residues capable of hydrogen exchange transferable to a broad set of proteins is lacking. We have developed a machine learning method based on random forest that can predict whether a residue experiences hydrogen exchange. Using data from the Start2Fold database, which contains information on 13,306 residues (3,790 of which experience hydrogen exchange and 9,516 which do not exchange), our method achieves good performance. Specifically, we achieve an overall out-of-bag (OOB) error, an unbiased estimate of the test set error, of 20.3 percent. Using a randomly selected test data set consisting of 500 residues experiencing hydrogen exchange and 500 which do not, our method achieves an accuracy of 0.79, a recall of 0.74, a precision of 0.82, and an F1 score of 0.78.

摘要

蛋白质氢交换的信息有助于描绘参与蛋白质-蛋白质相互作用的关键区域,并为确定基因变异的功能作用及其在疾病过程中的可能机制提供重要见解。先前的研究表明,氢交换的程度受氢键形成、溶剂可及性、与其他残基的接近程度以及实验条件的影响。然而,目前缺乏一种可广泛应用于多种蛋白质的、用于识别能够进行氢交换的残基的通用预测方法。我们开发了一种基于随机森林的机器学习方法,该方法可以预测一个残基是否会发生氢交换。利用来自Start2Fold数据库的数据(该数据库包含13306个残基的信息,其中3790个残基发生氢交换,9516个残基不发生交换),我们的方法取得了良好的性能。具体而言,我们得到的总体袋外(OOB)误差(测试集误差的无偏估计)为20.3%。使用一个由500个发生氢交换的残基和500个不发生氢交换的残基组成的随机选择的测试数据集,我们的方法实现了0.79的准确率、0.74的召回率、0.82的精确率和0.78的F1分数。