• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

COMSAT:基于支持向量机和混合整数线性规划的跨膜蛋白残基接触预测

COMSAT: Residue contact prediction of transmembrane proteins based on support vector machines and mixed integer linear programming.

作者信息

Zhang Huiling, Huang Qingsheng, Bei Zhendong, Wei Yanjie, Floudas Christodoulos A

机构信息

Centre for High Performance Computing, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.

Center for Cloud Computing, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.

出版信息

Proteins. 2016 Mar;84(3):332-48. doi: 10.1002/prot.24979. Epub 2016 Jan 20.

DOI:10.1002/prot.24979
PMID:26756402
Abstract

In this article, we present COMSAT, a hybrid framework for residue contact prediction of transmembrane (TM) proteins, integrating a support vector machine (SVM) method and a mixed integer linear programming (MILP) method. COMSAT consists of two modules: COMSAT_SVM which is trained mainly on position-specific scoring matrix features, and COMSAT_MILP which is an ab initio method based on optimization models. Contacts predicted by the SVM model are ranked by SVM confidence scores, and a threshold is trained to improve the reliability of the predicted contacts. For TM proteins with no contacts above the threshold, COMSAT_MILP is used. The proposed hybrid contact prediction scheme was tested on two independent TM protein sets based on the contact definition of 14 Å between Cα-Cα atoms. First, using a rigorous leave-one-protein-out cross validation on the training set of 90 TM proteins, an accuracy of 66.8%, a coverage of 12.3%, a specificity of 99.3% and a Matthews' correlation coefficient (MCC) of 0.184 were obtained for residue pairs that are at least six amino acids apart. Second, when tested on a test set of 87 TM proteins, the proposed method showed a prediction accuracy of 64.5%, a coverage of 5.3%, a specificity of 99.4% and a MCC of 0.106. COMSAT shows satisfactory results when compared with 12 other state-of-the-art predictors, and is more robust in terms of prediction accuracy as the length and complexity of TM protein increase. COMSAT is freely accessible at http://hpcc.siat.ac.cn/COMSAT/.

摘要

在本文中,我们介绍了COMSAT,这是一种用于跨膜(TM)蛋白残基接触预测的混合框架,它整合了支持向量机(SVM)方法和混合整数线性规划(MILP)方法。COMSAT由两个模块组成:主要基于位置特异性评分矩阵特征进行训练的COMSAT_SVM,以及基于优化模型的从头算方法COMSAT_MILP。SVM模型预测的接触通过SVM置信度得分进行排序,并训练一个阈值以提高预测接触的可靠性。对于阈值以上没有接触的TM蛋白,则使用COMSAT_MILP。基于Cα-Cα原子之间14 Å的接触定义,在两个独立的TM蛋白集上对所提出的混合接触预测方案进行了测试。首先,在90个TM蛋白的训练集上使用严格的留一蛋白交叉验证,对于至少相隔六个氨基酸的残基对,获得了66.8%的准确率、12.3%的覆盖率、99.3%的特异性和0.184的马修斯相关系数(MCC)。其次,在87个TM蛋白的测试集上进行测试时,所提出的方法显示出64.5%的预测准确率、5.3%的覆盖率、99.4%的特异性和0.106的MCC。与其他12种最先进的预测器相比,COMSAT显示出令人满意的结果,并且随着TM蛋白的长度和复杂性增加,在预测准确性方面更具稳健性。可通过http://hpcc.siat.ac.cn/COMSAT/免费访问COMSAT。

相似文献

1
COMSAT: Residue contact prediction of transmembrane proteins based on support vector machines and mixed integer linear programming.COMSAT:基于支持向量机和混合整数线性规划的跨膜蛋白残基接触预测
Proteins. 2016 Mar;84(3):332-48. doi: 10.1002/prot.24979. Epub 2016 Jan 20.
2
Prediction of transmembrane regions of beta-barrel proteins using ANN- and SVM-based methods.使用基于人工神经网络和支持向量机的方法预测β-桶状蛋白的跨膜区域。
Proteins. 2004 Jul 1;56(1):11-8. doi: 10.1002/prot.20092.
3
COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.COMTOP:通过混合整数线性优化进行蛋白质残基-残基接触预测。
Membranes (Basel). 2021 Jun 30;11(7):503. doi: 10.3390/membranes11070503.
4
Improved method for predicting beta-turn using support vector machine.使用支持向量机预测β-转角的改进方法。
Bioinformatics. 2005 May 15;21(10):2370-4. doi: 10.1093/bioinformatics/bti358. Epub 2005 Mar 29.
5
Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.基于预测的二级结构集合和多重比对,以超过80%的准确率预测β转角。
BMC Bioinformatics. 2008 Oct 10;9:430. doi: 10.1186/1471-2105-9-430.
6
Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method.β-桶状外膜蛋白拓扑结构预测方法的评估及一种共识预测方法
BMC Bioinformatics. 2005 Jan 12;6:7. doi: 10.1186/1471-2105-6-7.
7
Protein-RNA interface residue prediction using machine learning: an assessment of the state of the art.基于机器学习的蛋白质-RNA 界面残基预测:现状评估。
BMC Bioinformatics. 2012 May 10;13:89. doi: 10.1186/1471-2105-13-89.
8
Enhanced recognition of protein transmembrane domains with prediction-based structural profiles.利用基于预测的结构轮廓增强对蛋白质跨膜结构域的识别。
Bioinformatics. 2006 Feb 1;22(3):303-9. doi: 10.1093/bioinformatics/bti784. Epub 2005 Nov 17.
9
A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach.一种具有高片段重叠度量的蛋白质二级结构预测新方法:支持向量机方法。
J Mol Biol. 2001 Apr 27;308(2):397-407. doi: 10.1006/jmbi.2001.4580.
10
Transmembrane protein topology prediction using support vector machines.使用支持向量机进行跨膜蛋白拓扑结构预测。
BMC Bioinformatics. 2009 May 26;10:159. doi: 10.1186/1471-2105-10-159.

引用本文的文献

1
Inter-Residue Distance Prediction From Duet Deep Learning Models.基于二重深度学习模型的残基间距离预测
Front Genet. 2022 May 16;13:887491. doi: 10.3389/fgene.2022.887491. eCollection 2022.
2
COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.COMTOP:通过混合整数线性优化进行蛋白质残基-残基接触预测。
Membranes (Basel). 2021 Jun 30;11(7):503. doi: 10.3390/membranes11070503.
3
Evaluation of residue-residue contact prediction methods: From retrospective to prospective.评估残基残基接触预测方法:从回顾性到前瞻性。
PLoS Comput Biol. 2021 May 24;17(5):e1009027. doi: 10.1371/journal.pcbi.1009027. eCollection 2021 May.
4
IMPContact: An Interhelical Residue Contact Prediction Method.IMPContact:一种螺旋间残基接触预测方法。
Biomed Res Int. 2020 Mar 25;2020:4569037. doi: 10.1155/2020/4569037. eCollection 2020.
5
DeepBindPoc: a deep learning method to rank ligand binding pockets using molecular vector representation.DeepBindPoc:一种使用分子向量表示对配体结合口袋进行排名的深度学习方法。
PeerJ. 2020 Apr 6;8:e8864. doi: 10.7717/peerj.8864. eCollection 2020.
6
Predicting protein residue-residue contacts using random forests and deep networks.利用随机森林和深度网络预测蛋白质残基-残基接触。
BMC Bioinformatics. 2019 Mar 14;20(Suppl 2):100. doi: 10.1186/s12859-019-2627-6.
7
Folding Membrane Proteins by Deep Transfer Learning.利用深度迁移学习折叠膜蛋白。
Cell Syst. 2017 Sep 27;5(3):202-211.e3. doi: 10.1016/j.cels.2017.09.001.
8
Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts.利用二级结构、更高距离阈值下的接触以及非接触改进蛋白质结构重建。
BMC Bioinformatics. 2017 Aug 29;18(1):380. doi: 10.1186/s12859-017-1807-5.
9
Co-evolution techniques are reshaping the way we do structural bioinformatics.协同进化技术正在重塑我们进行结构生物信息学研究的方式。
F1000Res. 2017 Jul 25;6:1224. doi: 10.12688/f1000research.11543.1. eCollection 2017.
10
Membrane protein contact and structure prediction using co-evolution in conjunction with machine learning.结合机器学习利用协同进化进行膜蛋白接触和结构预测。
PLoS One. 2017 May 24;12(5):e0177866. doi: 10.1371/journal.pone.0177866. eCollection 2017.