• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过解决基于机器学习预测器的输出偏好问题,提高跨膜螺旋蛋白残基真实相对可及表面积的预测能力。

Prediction Enhancement of Residue Real-Value Relative Accessible Surface Area in Transmembrane Helical Proteins by Solving the Output Preference Problem of Machine Learning-Based Predictors.

机构信息

Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China , Shanghai 200240, China.

出版信息

J Chem Inf Model. 2015 Nov 23;55(11):2464-74. doi: 10.1021/acs.jcim.5b00246. Epub 2015 Oct 20.

DOI:10.1021/acs.jcim.5b00246
PMID:26455366
Abstract

The α-helical transmembrane proteins constitute 25% of the entire human proteome space and are difficult targets in high-resolution wet-lab structural studies, calling for accurate computational predictors. We present a novel sequence-based method called MemBrain-Rasa to predict relative solvent accessibility surface area (rASA) from primary sequences. MemBrain-Rasa features by an ensemble prediction protocol composed of a statistical machine-learning engine, which is trained in the sequential feature space, and a segment template similarity-based engine, which is constructed with solved structures and sequence alignment. We locally constructed a comprehensive database of residue relative solvent accessibility surface area from the solved protein 3D structures in the PDB database. It is searched against for segment templates that are expected to be structurally similar to the query sequence's segments. The segment template-based prediction is then fused with the support vector regression outputs using knowledge rules. Our experiments show that pure machine learning output cannot cover the entire rASA solution space and will have a serious prediction preference problem due to the relatively small size of membrane protein structures that can be used as the training samples. The template-based engine solves this problem very well, resulting in significant improvement of the prediction performance. MemBrain-Rasa achieves a Pearson correlation coefficient of 0.733 and mean absolute error of 13.593 on the benchmark dataset, which are 26.4% and 26.1% better than existing predictors. MemBrain-Rasa represents a new progress in structure modeling of α-helical transmembrane proteins. MemBrain-Rasa is available at www.csbio.sjtu.edu.cn/bioinf/MemBrain/.

摘要

α-螺旋跨膜蛋白构成了人类蛋白质组空间的 25%,是高分辨率湿实验室结构研究中的困难目标,因此需要准确的计算预测器。我们提出了一种新的基于序列的方法,称为 MemBrain-Rasa,用于从原始序列预测相对溶剂可及表面积 (rASA)。MemBrain-Rasa 的特点是采用集成预测协议,该协议由统计机器学习引擎组成,该引擎在顺序特征空间中进行训练,以及基于片段模板相似性的引擎,该引擎使用已解决的结构和序列比对构建。我们在本地构建了一个包含 PDB 数据库中已解决蛋白质 3D 结构的残基相对溶剂可及表面积的综合数据库。它用于搜索预期与查询序列片段在结构上相似的片段模板。然后使用知识规则将基于片段的预测与支持向量回归输出融合。我们的实验表明,纯机器学习输出不能涵盖整个 rASA 解决方案空间,并且由于可以用作训练样本的膜蛋白结构相对较小,因此会出现严重的预测偏好问题。基于模板的引擎很好地解决了这个问题,从而显著提高了预测性能。MemBrain-Rasa 在基准数据集上实现了 0.733 的 Pearson 相关系数和 13.593 的平均绝对误差,分别比现有预测器好 26.4%和 26.1%。MemBrain-Rasa 代表了α-螺旋跨膜蛋白结构建模的新进展。MemBrain-Rasa 可在 www.csbio.sjtu.edu.cn/bioinf/MemBrain/ 上获得。

相似文献

1
Prediction Enhancement of Residue Real-Value Relative Accessible Surface Area in Transmembrane Helical Proteins by Solving the Output Preference Problem of Machine Learning-Based Predictors.通过解决基于机器学习预测器的输出偏好问题,提高跨膜螺旋蛋白残基真实相对可及表面积的预测能力。
J Chem Inf Model. 2015 Nov 23;55(11):2464-74. doi: 10.1021/acs.jcim.5b00246. Epub 2015 Oct 20.
2
MemBrain: An Easy-to-Use Online Webserver for Transmembrane Protein Structure Prediction.MemBrain:一个易于使用的用于跨膜蛋白结构预测的在线网络服务器。
Nanomicro Lett. 2018;10(1):2. doi: 10.1007/s40820-017-0156-2. Epub 2017 Sep 27.
3
Topology Prediction Improvement of α-helical Transmembrane Proteins Through Helix-tail Modeling and Multiscale Deep Learning Fusion.通过螺旋-尾部建模和多尺度深度学习融合提高 α-螺旋跨膜蛋白的拓扑预测。
J Mol Biol. 2020 Feb 14;432(4):1279-1296. doi: 10.1016/j.jmb.2019.12.007. Epub 2019 Dec 21.
4
Enhancing the Prediction of Transmembrane β-Barrel Segments with Chain Learning and Feature Sparse Representation.利用链学习和特征稀疏表示增强跨膜β桶片段的预测
IEEE/ACM Trans Comput Biol Bioinform. 2016 Nov-Dec;13(6):1016-1026. doi: 10.1109/TCBB.2016.2528000. Epub 2016 Feb 11.
5
MemBrain-contact 2.0: a new two-stage machine learning model for the prediction enhancement of transmembrane protein residue contacts in the full chain.MemBrain-contact 2.0:一种用于增强全链跨膜蛋白残基接触预测的新型两阶段机器学习模型。
Bioinformatics. 2018 Jan 15;34(2):230-238. doi: 10.1093/bioinformatics/btx593.
6
High-accuracy prediction of transmembrane inter-helix contacts and application to GPCR 3D structure modeling.高精准度跨膜螺旋间接触预测及其在 GPCR 三维结构建模中的应用。
Bioinformatics. 2013 Oct 15;29(20):2579-87. doi: 10.1093/bioinformatics/btt440. Epub 2013 Aug 14.
7
MemBrain: improving the accuracy of predicting transmembrane helices.MemBrain:提高预测跨膜螺旋的准确性。
PLoS One. 2008 Jun 11;3(6):e2399. doi: 10.1371/journal.pone.0002399.
8
Enhanced recognition of protein transmembrane domains with prediction-based structural profiles.利用基于预测的结构轮廓增强对蛋白质跨膜结构域的识别。
Bioinformatics. 2006 Feb 1;22(3):303-9. doi: 10.1093/bioinformatics/bti784. Epub 2005 Nov 17.
9
Lipid exposure prediction enhances the inference of rotational angles of transmembrane helices.脂质暴露预测增强了跨膜螺旋旋转角度的推断。
BMC Bioinformatics. 2013 Oct 11;14:304. doi: 10.1186/1471-2105-14-304.
10
Constructing query-driven dynamic machine learning model with application to protein-ligand binding sites prediction.构建查询驱动的动态机器学习模型及其在蛋白质-配体结合位点预测中的应用。
IEEE Trans Nanobioscience. 2015 Jan;14(1):45-58. doi: 10.1109/TNB.2015.2394328.

引用本文的文献

1
TMP- SSurface2: A Novel Deep Learning-Based Surface Accessibility Predictor for Transmembrane Protein Sequence.TMP - SSurface2:一种基于深度学习的新型跨膜蛋白序列表面可及性预测工具。
Front Genet. 2021 Mar 15;12:656140. doi: 10.3389/fgene.2021.656140. eCollection 2021.
2
TMPSS: A Deep Learning-Based Predictor for Secondary Structure and Topology Structure Prediction of Alpha-Helical Transmembrane Proteins.TMPSS:一种基于深度学习的α-螺旋跨膜蛋白二级结构和拓扑结构预测工具
Front Bioeng Biotechnol. 2021 Jan 25;8:629937. doi: 10.3389/fbioe.2020.629937. eCollection 2020.
3
Approaches to ab initio molecular replacement of α-helical transmembrane proteins.
从头开始的α-螺旋跨膜蛋白的分子置换方法。
Acta Crystallogr D Struct Biol. 2017 Dec 1;73(Pt 12):985-996. doi: 10.1107/S2059798317016436. Epub 2017 Nov 22.
4
Predictive Structure and Topology of Peroxisomal ATP-Binding Cassette (ABC) Transporters.过氧化物酶体ATP结合盒(ABC)转运蛋白的预测结构与拓扑学
Int J Mol Sci. 2017 Jul 22;18(7):1593. doi: 10.3390/ijms18071593.