• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测 HIV-1 与人类蛋白质相互作用的概率加权集成转移学习。

Probability weighted ensemble transfer learning for predicting interactions between HIV-1 and human proteins.

机构信息

Software College, Shenyang Normal University, Shenyang, China.

出版信息

PLoS One. 2013 Nov 18;8(11):e79606. doi: 10.1371/journal.pone.0079606. eCollection 2013.

DOI:10.1371/journal.pone.0079606
PMID:24260261
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3832534/
Abstract

Reconstruction of host-pathogen protein interaction networks is of great significance to reveal the underlying microbic pathogenesis. However, the current experimentally-derived networks are generally small and should be augmented by computational methods for less-biased biological inference. From the point of view of computational modelling, data scarcity, data unavailability and negative data sampling are the three major problems for host-pathogen protein interaction networks reconstruction. In this work, we are motivated to address the three concerns and propose a probability weighted ensemble transfer learning model for HIV-human protein interaction prediction (PWEN-TLM), where support vector machine (SVM) is adopted as the individual classifier of the ensemble model. In the model, data scarcity and data unavailability are tackled by homolog knowledge transfer. The importance of homolog knowledge is measured by the ROC-AUC metric of the individual classifiers, whose outputs are probability weighted to yield the final decision. In addition, we further validate the assumption that only the homolog knowledge is sufficient to train a satisfactory model for host-pathogen protein interaction prediction. Thus the model is more robust against data unavailability with less demanding data constraint. As regards with negative data construction, experiments show that exclusiveness of subcellular co-localized proteins is unbiased and more reliable than random sampling. Last, we conduct analysis of overlapped predictions between our model and the existing models, and apply the model to novel host-pathogen PPIs recognition for further biological research.

摘要

宿主-病原体蛋白相互作用网络的重建对于揭示潜在的微生物发病机制具有重要意义。然而,目前实验得出的网络通常较小,应该通过计算方法来进行扩充,以进行更无偏的生物学推断。从计算模型的角度来看,宿主-病原体蛋白相互作用网络的重建存在数据稀缺、数据不可用和负样本数据采样三个主要问题。在这项工作中,我们旨在解决这三个问题,并提出了一种用于 HIV-人类蛋白相互作用预测的概率加权集成转移学习模型(PWEN-TLM),其中支持向量机(SVM)被用作集成模型的个体分类器。在该模型中,通过同源知识转移来解决数据稀缺和数据不可用的问题。同源知识的重要性通过个体分类器的 ROC-AUC 度量来衡量,其输出被概率加权以得出最终决策。此外,我们进一步验证了这样的假设,即只有同源知识就足以训练出用于宿主-病原体蛋白相互作用预测的令人满意的模型。因此,该模型对数据不可用具有更强的鲁棒性,对数据约束的要求也较低。关于负样本数据的构建,实验表明,亚细胞共定位蛋白的排他性是无偏的,比随机采样更可靠。最后,我们分析了我们的模型与现有模型之间的重叠预测,并将该模型应用于新的宿主-病原体 PPIs 识别,以进行进一步的生物学研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/b4f8428e255c/pone.0079606.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/efe325f73392/pone.0079606.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/5dd088419bbd/pone.0079606.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/c7f8beca6e37/pone.0079606.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/b887c196b16b/pone.0079606.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/6c6bd5f59946/pone.0079606.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/0eab035df0e7/pone.0079606.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/b4f8428e255c/pone.0079606.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/efe325f73392/pone.0079606.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/5dd088419bbd/pone.0079606.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/c7f8beca6e37/pone.0079606.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/b887c196b16b/pone.0079606.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/6c6bd5f59946/pone.0079606.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/0eab035df0e7/pone.0079606.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a2/3832534/b4f8428e255c/pone.0079606.g007.jpg

相似文献

1
Probability weighted ensemble transfer learning for predicting interactions between HIV-1 and human proteins.用于预测 HIV-1 与人类蛋白质相互作用的概率加权集成转移学习。
PLoS One. 2013 Nov 18;8(11):e79606. doi: 10.1371/journal.pone.0079606. eCollection 2013.
2
A new sequence based encoding for prediction of host-pathogen protein interactions.一种基于新序列的编码方法,用于预测宿主-病原体蛋白相互作用。
Comput Biol Chem. 2019 Feb;78:170-177. doi: 10.1016/j.compbiolchem.2018.12.001. Epub 2018 Dec 5.
3
Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens.计算重建 HTLV 逆转录病毒和智人物种之间的蛋白质组范围的蛋白质相互作用网络。
BMC Bioinformatics. 2014 Jul 18;15(1):245. doi: 10.1186/1471-2105-15-245.
4
AdaBoost based multi-instance transfer learning for predicting proteome-wide interactions between Salmonella and human proteins.基于AdaBoost的多实例迁移学习用于预测沙门氏菌与人类蛋白质之间的全蛋白质组相互作用。
PLoS One. 2014 Oct 17;9(10):e110488. doi: 10.1371/journal.pone.0110488. eCollection 2014.
5
SVM ensemble based transfer learning for large-scale membrane proteins discrimination.基于 SVM 集成的迁移学习在大规模膜蛋白区分中的应用。
J Theor Biol. 2014 Jan 7;340:105-10. doi: 10.1016/j.jtbi.2013.09.007. Epub 2013 Sep 16.
6
Computationally predicting protein-RNA interactions using only positive and unlabeled examples.仅使用正例和未标记示例进行蛋白质-RNA相互作用的计算预测。
J Bioinform Comput Biol. 2015 Jun;13(3):1541005. doi: 10.1142/S021972001541005X. Epub 2015 Feb 8.
7
Predicting protein-protein interactions between human and hepatitis C virus via an ensemble learning method.通过集成学习方法预测人类与丙型肝炎病毒之间的蛋白质-蛋白质相互作用。
Mol Biosyst. 2014 Dec;10(12):3147-54. doi: 10.1039/c4mb00410h. Epub 2014 Sep 18.
8
Effectively Identifying Compound-Protein Interactions by Learning from Positive and Unlabeled Examples.通过从正例和无标签样例中学习来有效识别化合物-蛋白质相互作用。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):1832-1843. doi: 10.1109/TCBB.2016.2570211. Epub 2016 May 18.
9
Accurate prediction of protein-protein interactions by integrating potential evolutionary information embedded in PSSM profile and discriminative vector machine classifier.通过整合PSSM概况中嵌入的潜在进化信息和判别向量机分类器来准确预测蛋白质-蛋白质相互作用。
Oncotarget. 2017 Apr 4;8(14):23638-23649. doi: 10.18632/oncotarget.15564.
10
Gene ontology based transfer learning for protein subcellular localization.基于基因本体论的蛋白质亚细胞定位迁移学习。
BMC Bioinformatics. 2011 Feb 2;12:44. doi: 10.1186/1471-2105-12-44.

引用本文的文献

1
Computational Approach for Detection and Analysis of Human-Virus Protein-Protein Interactions.用于检测和分析人-病毒蛋白质-蛋白质相互作用的计算方法
Methods Mol Biol. 2025;2927:115-126. doi: 10.1007/978-1-0716-4546-8_6.
2
Deep Learning for Genomics: From Early Neural Nets to Modern Large Language Models.深度学习在基因组学中的应用:从早期神经网络到现代大型语言模型。
Int J Mol Sci. 2023 Nov 1;24(21):15858. doi: 10.3390/ijms242115858.
3
Segmentation-Assisted Fully Convolutional Neural Network Enhances Deep Learning Performance to Identify Proliferative Diabetic Retinopathy.

本文引用的文献

1
Multi-label multi-kernel transfer learning for human protein subcellular localization.多标签多内核迁移学习在人类蛋白质亚细胞定位中的应用。
PLoS One. 2012;7(6):e37716. doi: 10.1371/journal.pone.0037716. Epub 2012 Jun 13.
2
A novel biclustering approach to association rule mining for predicting HIV-1-human protein interactions.一种用于预测 HIV-1 人类蛋白质相互作用的关联规则挖掘的新型双聚类方法。
PLoS One. 2012;7(4):e32289. doi: 10.1371/journal.pone.0032289. Epub 2012 Apr 23.
3
Computational prediction of host-parasite protein interactions between P. falciparum and H. sapiens.
分割辅助全卷积神经网络提高深度学习性能以识别增殖性糖尿病视网膜病变。
J Clin Med. 2023 Jan 3;12(1):385. doi: 10.3390/jcm12010385.
4
In silico prediction of HIV-1-host molecular interactions and their directionality.基于计算机的 HIV-1 宿主分子相互作用及其方向性的预测。
PLoS Comput Biol. 2022 Feb 8;18(2):e1009720. doi: 10.1371/journal.pcbi.1009720. eCollection 2022 Feb.
5
A multitask transfer learning framework for the prediction of virus-human protein-protein interactions.一种用于预测病毒-人类蛋白质-蛋白质相互作用的多任务迁移学习框架。
BMC Bioinformatics. 2021 Nov 27;22(1):572. doi: 10.1186/s12859-021-04484-y.
6
Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions.理解微生物组与宿主相互作用机制的计算生物学和机器学习方法
Front Microbiol. 2021 May 11;12:618856. doi: 10.3389/fmicb.2021.618856. eCollection 2021.
7
Prediction of host-pathogen protein interactions by extended network model.通过扩展网络模型预测宿主-病原体蛋白相互作用
Turk J Biol. 2021 Apr 20;45(2):138-148. doi: 10.3906/biy-2009-4. eCollection 2021.
8
SARS-CoV-2-human protein-protein interaction network.严重急性呼吸综合征冠状病毒2与人的蛋白质-蛋白质相互作用网络。
Inform Med Unlocked. 2020;20:100413. doi: 10.1016/j.imu.2020.100413. Epub 2020 Aug 13.
9
In silico unravelling pathogen-host signaling cross-talks via pathogen mimicry and human protein-protein interaction networks.通过病原体模拟和人类蛋白质-蛋白质相互作用网络在计算机上解析病原体-宿主信号转导的相互作用。
Comput Struct Biotechnol J. 2019 Dec 27;18:100-113. doi: 10.1016/j.csbj.2019.12.008. eCollection 2020.
10
Neglog: Homology-Based Negative Data Sampling Method for Genome-Scale Reconstruction of Human Protein-Protein Interaction Networks.Neglog:基于同源性的负样本数据采样方法,用于人类蛋白质-蛋白质相互作用网络的全基因组规模重建。
Int J Mol Sci. 2019 Oct 12;20(20):5075. doi: 10.3390/ijms20205075.
疟原虫与人类宿主-寄生虫蛋白相互作用的计算预测。
PLoS One. 2011;6(11):e26960. doi: 10.1371/journal.pone.0026960. Epub 2011 Nov 17.
4
Gene Ontology-driven inference of protein-protein interactions using inducers.基于基因本体论的诱导剂驱动的蛋白质-蛋白质相互作用推断。
Bioinformatics. 2012 Jan 1;28(1):69-75. doi: 10.1093/bioinformatics/btr610. Epub 2011 Nov 4.
5
HIV-1 glycoprotein 41 ectodomain induces activation of the CD74 protein-mediated extracellular signal-regulated kinase/mitogen-activated protein kinase pathway to enhance viral infection.HIV-1 糖蛋白 41 胞外结构域诱导 CD74 蛋白介导的细胞外信号调节激酶/丝裂原活化蛋白激酶通路的激活,从而增强病毒感染。
J Biol Chem. 2011 Dec 30;286(52):44869-77. doi: 10.1074/jbc.M111.267393. Epub 2011 Oct 28.
6
Multi-kernel transfer learning based on Chou's PseAAC formulation for protein submitochondria localization.基于 Chou 的 PseAAC 构建的多核转移学习在蛋白质亚线粒体定位中的应用。
J Theor Biol. 2012 Jan 21;293:121-30. doi: 10.1016/j.jtbi.2011.10.015. Epub 2011 Oct 21.
7
Mechanism of host cell MAPK/ERK-2 incorporation into lentivirus particles: characterization of the interaction between MAPK/ERK-2 and proline-rich-domain containing capsid region of structural protein Gag.宿主细胞 MAPK/ERK-2 整合入慢病毒颗粒的机制:MAPK/ERK-2 与结构蛋白 Gag 的富含脯氨酸结构域之间相互作用的鉴定。
J Mol Biol. 2011 Jul 22;410(4):681-97. doi: 10.1016/j.jmb.2011.03.022.
8
Interactome networks and human disease.相互作用组网络与人类疾病。
Cell. 2011 Mar 18;144(6):986-98. doi: 10.1016/j.cell.2011.02.016.
9
Supervised learning and prediction of physical interactions between human and HIV proteins.人类与 HIV 蛋白间物理相互作用的有监督学习和预测。
Infect Genet Evol. 2011 Jul;11(5):917-23. doi: 10.1016/j.meegid.2011.02.022. Epub 2011 Mar 5.
10
Gene ontology based transfer learning for protein subcellular localization.基于基因本体论的蛋白质亚细胞定位迁移学习。
BMC Bioinformatics. 2011 Feb 2;12:44. doi: 10.1186/1471-2105-12-44.