• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 PSSM 特征的堆叠稀疏自编码器改进自相互作用蛋白的预测。

Improving Prediction of Self-interacting Proteins Using Stacked Sparse Auto-Encoder with PSSM profiles.

机构信息

University of Chinese Academy of Sciences, Beijing 100049, China.

Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Urumqi 830011, China.

出版信息

Int J Biol Sci. 2018 May 23;14(8):983-991. doi: 10.7150/ijbs.23817. eCollection 2018.

DOI:10.7150/ijbs.23817
PMID:29989064
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6036743/
Abstract

Self-interacting proteins (SIPs) play a significant role in the execution of most important molecular processes in cells, such as signal transduction, gene expression regulation, immune response and enzyme activation. Although the traditional experimental methods can be used to generate SIPs data, it is very expensive and time-consuming based only on biological technique. Therefore, it is important and urgent to develop an efficient computational method for SIPs detection. In this study, we present a novel SIPs identification method based on machine learning technology by combing the Zernike Moments (ZMs) descriptor on Position Specific Scoring Matrix (PSSM) with Probabilistic Classification Vector Machines (PCVM) and Stacked Sparse Auto-Encoder (SSAE). More specifically, an efficient feature extraction technique called ZMs is firstly utilized to generate feature vectors on Position Specific Scoring Matrix (PSSM); Then, Deep neural network is employed for reducing the feature dimensions and noise; Finally, the Probabilistic Classification Vector Machine is used to execute the classification. The prediction performance of the proposed method is evaluated on and SIPs datasets via cross-validation. The experimental results indicate that the proposed method can achieve good accuracies of 92.55% and 97.47%, respectively. To further evaluate the advantage of our scheme for SIPs prediction, we also compared the PCVM classifier with the Support Vector Machine (SVM) and other existing techniques on the same data sets. Comparison results reveal that the proposed strategy is outperforms other methods and could be a used tool for identifying SIPs.

摘要

自相互作用蛋白 (SIPs) 在细胞中执行大多数重要的分子过程中发挥着重要作用,例如信号转导、基因表达调控、免疫反应和酶激活。虽然传统的实验方法可用于生成 SIPs 数据,但仅基于生物技术,其成本非常高且耗时。因此,开发一种有效的 SIPs 检测计算方法非常重要和紧迫。在这项研究中,我们提出了一种基于机器学习技术的新型 SIPs 识别方法,该方法结合了位置特异性评分矩阵 (PSSM) 上的 Zernike 矩 (ZMs) 描述符与概率分类向量机 (PCVM) 和堆叠稀疏自动编码器 (SSAE)。更具体地说,首先利用一种称为 ZMs 的高效特征提取技术,在位置特异性评分矩阵 (PSSM) 上生成特征向量;然后,采用深度神经网络来降低特征维度和噪声;最后,使用概率分类向量机进行分类。通过交叉验证,在 和 数据集上评估了所提出方法的预测性能。实验结果表明,所提出的方法分别可以达到 92.55%和 97.47%的良好精度。为了进一步评估我们的 SIPs 预测方案的优势,我们还在相同的数据集中将 PCVM 分类器与支持向量机 (SVM) 和其他现有技术进行了比较。比较结果表明,所提出的策略优于其他方法,可作为识别 SIPs 的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/455a9f71a8cb/ijbsv14p0983g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/79856144a964/ijbsv14p0983g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/4ab4dd70cfa3/ijbsv14p0983g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/6814cd383c0a/ijbsv14p0983g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/1e681251d134/ijbsv14p0983g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/32842d3ed61f/ijbsv14p0983g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/455a9f71a8cb/ijbsv14p0983g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/79856144a964/ijbsv14p0983g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/4ab4dd70cfa3/ijbsv14p0983g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/6814cd383c0a/ijbsv14p0983g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/1e681251d134/ijbsv14p0983g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/32842d3ed61f/ijbsv14p0983g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e27/6036743/455a9f71a8cb/ijbsv14p0983g006.jpg

相似文献

1
Improving Prediction of Self-interacting Proteins Using Stacked Sparse Auto-Encoder with PSSM profiles.利用 PSSM 特征的堆叠稀疏自编码器改进自相互作用蛋白的预测。
Int J Biol Sci. 2018 May 23;14(8):983-991. doi: 10.7150/ijbs.23817. eCollection 2018.
2
Prediction of protein self-interactions using stacked long short-term memory from protein sequences information.利用来自蛋白质序列信息的堆叠长短期记忆预测蛋白质自相互作用。
BMC Syst Biol. 2018 Dec 21;12(Suppl 8):129. doi: 10.1186/s12918-018-0647-x.
3
PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences.PCVMZM:使用概率分类向量机模型结合泽尼克矩描述符从蛋白质序列预测蛋白质-蛋白质相互作用
Int J Mol Sci. 2017 May 11;18(5):1029. doi: 10.3390/ijms18051029.
4
Identification of self-interacting proteins by integrating random projection classifier and finite impulse response filter.通过整合随机投影分类器和有限脉冲响应滤波器来鉴定自我相互作用的蛋白质。
BMC Genomics. 2019 Dec 27;20(Suppl 13):928. doi: 10.1186/s12864-019-6301-1.
5
Predicting protein-protein interactions from protein sequences by a stacked sparse autoencoder deep neural network.通过堆叠式稀疏自动编码器深度神经网络从蛋白质序列预测蛋白质-蛋白质相互作用。
Mol Biosyst. 2017 Jun 27;13(7):1336-1344. doi: 10.1039/c7mb00188f.
6
Detection of Interactions between Proteins by Using Legendre Moments Descriptor to Extract Discriminatory Information Embedded in PSSM.利用勒让德矩描述符提取PSSM中嵌入的鉴别信息来检测蛋白质之间的相互作用
Molecules. 2017 Aug 18;22(8):1366. doi: 10.3390/molecules22081366.
7
Identification of self-interacting proteins by exploring evolutionary information embedded in PSI-BLAST-constructed position specific scoring matrix.通过探索嵌入在PSI-BLAST构建的位置特异性评分矩阵中的进化信息来鉴定自相互作用蛋白。
Oncotarget. 2016 Dec 13;7(50):82440-82449. doi: 10.18632/oncotarget.12517.
8
PSPEL: In Silico Prediction of Self-Interacting Proteins from Amino Acids Sequences Using Ensemble Learning.PSPEL:使用集成学习从氨基酸序列预测蛋白质相互作用。
IEEE/ACM Trans Comput Biol Bioinform. 2017 Sep-Oct;14(5):1165-1172. doi: 10.1109/TCBB.2017.2649529. Epub 2017 Jan 10.
9
Robust and accurate prediction of self-interacting proteins from protein sequence information by exploiting weighted sparse representation based classifier.利用基于加权稀疏表示的分类器从蛋白质序列信息中准确预测自我相互作用的蛋白质。
BMC Bioinformatics. 2022 Dec 1;23(Suppl 7):518. doi: 10.1186/s12859-022-04880-y.
10
Highly accurate prediction of protein self-interactions by incorporating the average block and PSSM information into the general PseAAC.通过将平均块和位置特异性得分矩阵(PSSM)信息纳入通用伪氨基酸组成(PseAAC)来实现对蛋白质自相互作用的高度准确预测。
J Theor Biol. 2017 Nov 7;432:80-86. doi: 10.1016/j.jtbi.2017.08.009. Epub 2017 Aug 9.

引用本文的文献

1
ISTRF: Identification of sucrose transporter using random forest.ISTRF:利用随机森林鉴定蔗糖转运蛋白
Front Genet. 2022 Sep 12;13:1012828. doi: 10.3389/fgene.2022.1012828. eCollection 2022.
2
Predicting Protein-Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence.基于蛋白质序列的集成学习模型预测蛋白质-蛋白质相互作用
Biology (Basel). 2022 Jun 30;11(7):995. doi: 10.3390/biology11070995.
3
SIPGCN: A Novel Deep Learning Model for Predicting Self-Interacting Proteins from Sequence Information Using Graph Convolutional Networks.

本文引用的文献

1
Predicting protein-protein interactions from protein sequences by a stacked sparse autoencoder deep neural network.通过堆叠式稀疏自动编码器深度神经网络从蛋白质序列预测蛋白质-蛋白质相互作用。
Mol Biosyst. 2017 Jun 27;13(7):1336-1344. doi: 10.1039/c7mb00188f.
2
PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences.PCVMZM:使用概率分类向量机模型结合泽尼克矩描述符从蛋白质序列预测蛋白质-蛋白质相互作用
Int J Mol Sci. 2017 May 11;18(5):1029. doi: 10.3390/ijms18051029.
3
Incorporation of Efficient Second-Order Solvers Into Latent Factor Models for Accurate Prediction of Missing QoS Data.
SIPGCN:一种使用图卷积网络从序列信息预测自相互作用蛋白的新型深度学习模型。
Biomedicines. 2022 Jun 29;10(7):1543. doi: 10.3390/biomedicines10071543.
4
NLPEI: A Novel Self-Interacting Protein Prediction Model Based on Natural Language Processing and Evolutionary Information.NLPEI:一种基于自然语言处理和进化信息的新型自相互作用蛋白预测模型。
Evol Bioinform Online. 2020 Dec 26;16:1176934320984171. doi: 10.1177/1176934320984171. eCollection 2020.
5
BGFE: A Deep Learning Model for ncRNA-Protein Interaction Predictions Based on Improved Sequence Information.BGFE:一种基于改进序列信息的 ncRNA-蛋白质相互作用预测深度学习模型。
Int J Mol Sci. 2019 Feb 23;20(4):978. doi: 10.3390/ijms20040978.
6
Prediction of Self-Interacting Proteins from Protein Sequence Information Based on Random Projection Model and Fast Fourier Transform.基于随机投影模型和快速傅里叶变换的蛋白质序列信息自相互作用蛋白预测
Int J Mol Sci. 2019 Feb 21;20(4):930. doi: 10.3390/ijms20040930.
7
Special issue on Computational Resources and Methods in Biological Sciences.生物科学中的计算资源和方法特刊
Int J Biol Sci. 2018 Jul 1;14(8):807-810. doi: 10.7150/ijbs.27554. eCollection 2018.
将高效二阶求解器纳入潜在因素模型,以准确预测缺失的 QoS 数据。
IEEE Trans Cybern. 2018 Apr;48(4):1216-1228. doi: 10.1109/TCYB.2017.2685521. Epub 2017 Apr 14.
4
SPAR: a random forest-based predictor for self-interacting proteins with fine-grained domain information.SPAR:一种基于随机森林的具有细粒度结构域信息的自相互作用蛋白预测器。
Amino Acids. 2016 Jul;48(7):1655-65. doi: 10.1007/s00726-016-2226-z. Epub 2016 Apr 13.
5
Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM.基于一致序列和分段位置特异性得分矩阵预测低相似性序列的蛋白质结构类别
Comput Math Methods Med. 2015;2015:370756. doi: 10.1155/2015/370756. Epub 2015 Dec 15.
6
Identification of Proteins Implicated in the Increased Heart Rate in ShenSongYangXin-Treated Bradycardia Rabbits by iTRAQ-Based Quantitative Proteomics.基于iTRAQ定量蛋白质组学鉴定参松养心治疗家兔心动过缓时心率增加相关的蛋白质
Evid Based Complement Alternat Med. 2015;2015:385953. doi: 10.1155/2015/385953. Epub 2015 Dec 3.
7
Stacked Sparse Autoencoder (SSAE) for Nuclei Detection on Breast Cancer Histopathology Images.用于乳腺癌组织病理学图像细胞核检测的堆叠稀疏自动编码器(SSAE)
IEEE Trans Med Imaging. 2016 Jan;35(1):119-30. doi: 10.1109/TMI.2015.2458702. Epub 2015 Jul 20.
8
Inverse-Free Extreme Learning Machine With Optimal Information Updating.具有最优信息更新的无逆极限学习机。
IEEE Trans Cybern. 2016 May;46(5):1229-41. doi: 10.1109/TCYB.2015.2434841. Epub 2015 Jun 1.
9
A Nonnegative Latent Factor Model for Large-Scale Sparse Matrices in Recommender Systems via Alternating Direction Method.通过交替方向法对推荐系统中大规模稀疏矩阵进行非负潜在因子建模。
IEEE Trans Neural Netw Learn Syst. 2016 Mar;27(3):579-92. doi: 10.1109/TNNLS.2015.2415257. Epub 2015 May 22.
10
LocFuse: human protein-protein interaction prediction via classifier fusion using protein localization information.LocFuse:利用蛋白质定位信息通过分类器融合进行人类蛋白质-蛋白质相互作用预测
Genomics. 2014 Dec;104(6 Pt B):496-503. doi: 10.1016/j.ygeno.2014.10.006. Epub 2014 Oct 16.