• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用加权极限学习机结合蛋白质进化信息预测蛋白质自相互作用的计算方法。

Computational methods using weighed-extreme learning machine to predict protein self-interactions with protein evolutionary information.

作者信息

An Ji-Yong, Zhang Lei, Zhou Yong, Zhao Yu-Jun, Wang Da-Fu

机构信息

School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, 21116, Jiangsu, China.

出版信息

J Cheminform. 2017 Aug 18;9(1):47. doi: 10.1186/s13321-017-0233-z.

DOI:10.1186/s13321-017-0233-z
PMID:29086182
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5561767/
Abstract

Self-interactions Proteins (SIPs) is important for their biological activity owing to the inherent interaction amongst their secondary structures or domains. However, due to the limitations of experimental Self-interactions detection, one major challenge in the study of prediction SIPs is how to exploit computational approaches for SIPs detection based on evolutionary information contained protein sequence. In the work, we presented a novel computational approach named WELM-LAG, which combined the Weighed-Extreme Learning Machine (WELM) classifier with Local Average Group (LAG) to predict SIPs based on protein sequence. The major improvement of our method lies in presenting an effective feature extraction method used to represent candidate Self-interactions proteins by exploring the evolutionary information embedded in PSI-BLAST-constructed position specific scoring matrix (PSSM); and then employing a reliable and robust WELM classifier to carry out classification. In addition, the Principal Component Analysis (PCA) approach is used to reduce the impact of noise. The WELM-LAG method gave very high average accuracies of 92.94 and 96.74% on yeast and human datasets, respectively. Meanwhile, we compared it with the state-of-the-art support vector machine (SVM) classifier and other existing methods on human and yeast datasets, respectively. Comparative results indicated that our approach is very promising and may provide a cost-effective alternative for predicting SIPs. In addition, we developed a freely available web server called WELM-LAG-SIPs to predict SIPs. The web server is available at http://219.219.62.123:8888/WELMLAG/ .

摘要

自相互作用蛋白(SIPs)因其二级结构或结构域之间的内在相互作用而对其生物学活性至关重要。然而,由于实验性自相互作用检测的局限性,预测SIPs研究中的一个主要挑战是如何基于蛋白质序列中包含的进化信息开发用于SIPs检测的计算方法。在这项工作中,我们提出了一种名为WELM-LAG的新型计算方法,该方法将加权极限学习机(WELM)分类器与局部平均组(LAG)相结合,以基于蛋白质序列预测SIPs。我们方法的主要改进在于提出了一种有效的特征提取方法,通过探索嵌入在PSI-BLAST构建的位置特异性评分矩阵(PSSM)中的进化信息来表示候选自相互作用蛋白;然后采用可靠且稳健的WELM分类器进行分类。此外,主成分分析(PCA)方法用于减少噪声的影响。WELM-LAG方法在酵母和人类数据集上分别给出了92.94%和96.74%的非常高的平均准确率。同时,我们分别在人类和酵母数据集上将其与最先进的支持向量机(SVM)分类器和其他现有方法进行了比较。比较结果表明我们的方法非常有前景,可能为预测SIPs提供一种经济高效的替代方案。此外,我们开发了一个名为WELM-LAG-SIPs的免费网络服务器来预测SIPs。该网络服务器可在http://219.219.62.123:8888/WELMLAG/ 获得。

相似文献

1
Computational methods using weighed-extreme learning machine to predict protein self-interactions with protein evolutionary information.使用加权极限学习机结合蛋白质进化信息预测蛋白质自相互作用的计算方法。
J Cheminform. 2017 Aug 18;9(1):47. doi: 10.1186/s13321-017-0233-z.
2
Identification of self-interacting proteins by exploring evolutionary information embedded in PSI-BLAST-constructed position specific scoring matrix.通过探索嵌入在PSI-BLAST构建的位置特异性评分矩阵中的进化信息来鉴定自相互作用蛋白。
Oncotarget. 2016 Dec 13;7(50):82440-82449. doi: 10.18632/oncotarget.12517.
3
An efficient computational method for predicting drug-target interactions using weighted extreme learning machine and speed up robot features.一种使用加权极限学习机和加速机器人特征预测药物-靶点相互作用的高效计算方法。
BioData Min. 2021 Jan 20;14(1):3. doi: 10.1186/s13040-021-00242-1.
4
Highly accurate prediction of protein self-interactions by incorporating the average block and PSSM information into the general PseAAC.通过将平均块和位置特异性得分矩阵(PSSM)信息纳入通用伪氨基酸组成(PseAAC)来实现对蛋白质自相互作用的高度准确预测。
J Theor Biol. 2017 Nov 7;432:80-86. doi: 10.1016/j.jtbi.2017.08.009. Epub 2017 Aug 9.
5
Predicting Self-Interacting Proteins Using a Recurrent Neural Network and Protein Evolutionary Information.使用递归神经网络和蛋白质进化信息预测自我相互作用蛋白质。
Evol Bioinform Online. 2020 May 28;16:1176934320924674. doi: 10.1177/1176934320924674. eCollection 2020.
6
Identification of self-interacting proteins by integrating random projection classifier and finite impulse response filter.通过整合随机投影分类器和有限脉冲响应滤波器来鉴定自我相互作用的蛋白质。
BMC Genomics. 2019 Dec 27;20(Suppl 13):928. doi: 10.1186/s12864-019-6301-1.
7
Robust and accurate prediction of protein self-interactions from amino acids sequence using evolutionary information.利用进化信息从氨基酸序列对蛋白质自身相互作用进行稳健且准确的预测。
Mol Biosyst. 2016 Nov 15;12(12):3702-3710. doi: 10.1039/c6mb00599c.
8
An Efficient Feature Extraction Technique Based on Local Coding PSSM and Multifeatures Fusion for Predicting Protein-Protein Interactions.一种基于局部编码位置特异性得分矩阵和多特征融合的高效蛋白质-蛋白质相互作用预测特征提取技术
Evol Bioinform Online. 2019 Oct 3;15:1176934319879920. doi: 10.1177/1176934319879920. eCollection 2019.
9
Improving protein-protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model.利用蛋白质进化信息和相关向量机模型提高蛋白质-蛋白质相互作用预测准确性
Protein Sci. 2016 Oct;25(10):1825-33. doi: 10.1002/pro.2991. Epub 2016 Aug 9.
10
NLPEI: A Novel Self-Interacting Protein Prediction Model Based on Natural Language Processing and Evolutionary Information.NLPEI:一种基于自然语言处理和进化信息的新型自相互作用蛋白预测模型。
Evol Bioinform Online. 2020 Dec 26;16:1176934320984171. doi: 10.1177/1176934320984171. eCollection 2020.

引用本文的文献

1
PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles.PSSMCOOL:一个用于从PSSM谱生成基于进化的蛋白质序列描述符的综合R包。
Biol Methods Protoc. 2022 Mar 30;7(1):bpac008. doi: 10.1093/biomethods/bpac008. eCollection 2022.
2
An efficient computational method for predicting drug-target interactions using weighted extreme learning machine and speed up robot features.一种使用加权极限学习机和加速机器人特征预测药物-靶点相互作用的高效计算方法。
BioData Min. 2021 Jan 20;14(1):3. doi: 10.1186/s13040-021-00242-1.
3
Predicting Self-Interacting Proteins Using a Recurrent Neural Network and Protein Evolutionary Information.

本文引用的文献

1
Highly Accurate Prediction of Protein-Protein Interactions via Incorporating Evolutionary Information and Physicochemical Characteristics.通过整合进化信息和物理化学特征实现蛋白质-蛋白质相互作用的高精度预测
Int J Mol Sci. 2016 Aug 25;17(9):1396. doi: 10.3390/ijms17091396.
2
Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding.基于序列的蛋白质-蛋白质相互作用预测:结合全局编码的加权稀疏表示模型
BMC Bioinformatics. 2016 Apr 26;17(1):184. doi: 10.1186/s12859-016-1035-4.
3
SPAR: a random forest-based predictor for self-interacting proteins with fine-grained domain information.
使用递归神经网络和蛋白质进化信息预测自我相互作用蛋白质。
Evol Bioinform Online. 2020 May 28;16:1176934320924674. doi: 10.1177/1176934320924674. eCollection 2020.
4
Protein Interface Complementarity and Gene Duplication Improve Link Prediction of Protein-Protein Interaction Network.蛋白质界面互补性和基因复制改善蛋白质-蛋白质相互作用网络的链接预测
Front Genet. 2020 Apr 2;11:291. doi: 10.3389/fgene.2020.00291. eCollection 2020.
5
An Efficient Feature Extraction Technique Based on Local Coding PSSM and Multifeatures Fusion for Predicting Protein-Protein Interactions.一种基于局部编码位置特异性得分矩阵和多特征融合的高效蛋白质-蛋白质相互作用预测特征提取技术
Evol Bioinform Online. 2019 Oct 3;15:1176934319879920. doi: 10.1177/1176934319879920. eCollection 2019.
6
Sequence-based Prediction of Protein-Protein Interactions Using Gray Wolf Optimizer-Based Relevance Vector Machine.基于灰狼优化器的相关向量机的蛋白质-蛋白质相互作用的序列预测
Evol Bioinform Online. 2019 May 2;15:1176934319844522. doi: 10.1177/1176934319844522. eCollection 2019.
SPAR:一种基于随机森林的具有细粒度结构域信息的自相互作用蛋白预测器。
Amino Acids. 2016 Jul;48(7):1655-65. doi: 10.1007/s00726-016-2226-z. Epub 2016 Apr 13.
4
iPPBS-Opt: A Sequence-Based Ensemble Classifier for Identifying Protein-Protein Binding Sites by Optimizing Imbalanced Training Datasets.iPPBS-Opt:一种基于序列的集成分类器,通过优化不平衡训练数据集来识别蛋白质-蛋白质结合位点。
Molecules. 2016 Jan 19;21(1):E95. doi: 10.3390/molecules21010095.
5
Identification of protein-protein binding sites by incorporating the physicochemical properties and stationary wavelet transforms into pseudo amino acid composition.通过将物理化学性质和静态小波变换纳入伪氨基酸组成来鉴定蛋白质-蛋白质结合位点。
J Biomol Struct Dyn. 2016 Sep;34(9):1946-61. doi: 10.1080/07391102.2015.1095116. Epub 2015 Oct 29.
6
iPPI-Esml: An ensemble classifier for identifying the interactions of proteins by incorporating their physicochemical properties and wavelet transforms into PseAAC.iPPI-Esml:一种通过将蛋白质的物理化学性质和小波变换纳入伪氨基酸组成来识别蛋白质相互作用的集成分类器。
J Theor Biol. 2015 Jul 21;377:47-56. doi: 10.1016/j.jtbi.2015.04.011. Epub 2015 Apr 20.
7
Prediction of Protein-Protein Interactions with Physicochemical Descriptors and Wavelet Transform via Random Forests.通过随机森林利用物理化学描述符和小波变换预测蛋白质-蛋白质相互作用
J Lab Autom. 2016 Jun;21(3):368-77. doi: 10.1177/2211068215581487. Epub 2015 Apr 16.
8
LocFuse: human protein-protein interaction prediction via classifier fusion using protein localization information.LocFuse:利用蛋白质定位信息通过分类器融合进行人类蛋白质-蛋白质相互作用预测
Genomics. 2014 Dec;104(6 Pt B):496-503. doi: 10.1016/j.ygeno.2014.10.006. Epub 2014 Oct 16.
9
MatrixDB, the extracellular matrix interaction database: updated content, a new navigator and expanded functionalities.MatrixDB,细胞外基质相互作用数据库:更新内容、新导航器及扩展功能
Nucleic Acids Res. 2015 Jan;43(Database issue):D321-7. doi: 10.1093/nar/gku1091. Epub 2014 Nov 6.
10
UniProt: a hub for protein information.通用蛋白质数据库(UniProt):蛋白质信息中心。
Nucleic Acids Res. 2015 Jan;43(Database issue):D204-12. doi: 10.1093/nar/gku989. Epub 2014 Oct 27.