• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于网络拓扑特性和生物信息识别必需蛋白质的新计算策略。

A new computational strategy for identifying essential proteins based on network topological properties and biological information.

作者信息

Qin Chao, Sun Yongqi, Dong Yadong

机构信息

Beijing Key Lab of Traffic Data Analysis and Mining, School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China.

出版信息

PLoS One. 2017 Jul 28;12(7):e0182031. doi: 10.1371/journal.pone.0182031. eCollection 2017.

DOI:10.1371/journal.pone.0182031
PMID:28753682
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5533339/
Abstract

Essential proteins are the proteins that are indispensable to the survival and development of an organism. Deleting a single essential protein will cause lethality or infertility. Identifying and analysing essential proteins are key to understanding the molecular mechanisms of living cells. There are two types of methods for predicting essential proteins: experimental methods, which require considerable time and resources, and computational methods, which overcome the shortcomings of experimental methods. However, the prediction accuracy of computational methods for essential proteins requires further improvement. In this paper, we propose a new computational strategy named CoTB for identifying essential proteins based on a combination of topological properties, subcellular localization information and orthologous protein information. First, we introduce several topological properties of the protein-protein interaction (PPI) network. Second, we propose new methods for measuring orthologous information and subcellular localization and a new computational strategy that uses a random forest prediction model to obtain a probability score for the proteins being essential. Finally, we conduct experiments on four different Saccharomyces cerevisiae datasets. The experimental results demonstrate that our strategy for identifying essential proteins outperforms traditional computational methods and the most recently developed method, SON. In particular, our strategy improves the prediction accuracy to 89, 78, 79, and 85 percent on the YDIP, YMIPS, YMBD and YHQ datasets at the top 100 level, respectively.

摘要

必需蛋白质是生物体生存和发育所不可或缺的蛋白质。删除单个必需蛋白质会导致致死性或不育。识别和分析必需蛋白质是理解活细胞分子机制的关键。预测必需蛋白质有两种方法:实验方法,需要大量时间和资源;计算方法,克服了实验方法的缺点。然而,必需蛋白质计算方法的预测准确性还需要进一步提高。在本文中,我们提出了一种名为CoTB的新计算策略,用于基于拓扑性质、亚细胞定位信息和直系同源蛋白质信息的组合来识别必需蛋白质。首先,我们介绍了蛋白质-蛋白质相互作用(PPI)网络的几种拓扑性质。其次,我们提出了测量直系同源信息和亚细胞定位的新方法以及一种新的计算策略,该策略使用随机森林预测模型来获得蛋白质为必需蛋白质的概率得分。最后,我们在四个不同的酿酒酵母数据集上进行了实验。实验结果表明,我们识别必需蛋白质的策略优于传统计算方法和最近开发的SON方法。特别是,我们的策略在YDIP、YMIPS、YMBD和YHQ数据集上,在排名前100的水平上分别将预测准确率提高到了89%、78%、79%和85%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/827a6835ddc3/pone.0182031.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/cbcb69dd1634/pone.0182031.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/824e315c087e/pone.0182031.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/23edc1dca159/pone.0182031.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/9d9f6fbbe95c/pone.0182031.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/17c9fb2a4c06/pone.0182031.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/4e587312986e/pone.0182031.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/8473ceff1e71/pone.0182031.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/156d97921f98/pone.0182031.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/dd9935d1d7dc/pone.0182031.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/954cba57bc82/pone.0182031.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/58d54d89c370/pone.0182031.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/5e6508b63455/pone.0182031.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/ec2f8d28bd41/pone.0182031.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/bcb6b03371f5/pone.0182031.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/827a6835ddc3/pone.0182031.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/cbcb69dd1634/pone.0182031.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/824e315c087e/pone.0182031.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/23edc1dca159/pone.0182031.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/9d9f6fbbe95c/pone.0182031.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/17c9fb2a4c06/pone.0182031.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/4e587312986e/pone.0182031.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/8473ceff1e71/pone.0182031.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/156d97921f98/pone.0182031.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/dd9935d1d7dc/pone.0182031.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/954cba57bc82/pone.0182031.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/58d54d89c370/pone.0182031.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/5e6508b63455/pone.0182031.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/ec2f8d28bd41/pone.0182031.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/bcb6b03371f5/pone.0182031.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4d/5533339/827a6835ddc3/pone.0182031.g015.jpg

相似文献

1
A new computational strategy for identifying essential proteins based on network topological properties and biological information.一种基于网络拓扑特性和生物信息识别必需蛋白质的新计算策略。
PLoS One. 2017 Jul 28;12(7):e0182031. doi: 10.1371/journal.pone.0182031. eCollection 2017.
2
A New Method for Identifying Essential Proteins Based on Network Topology Properties and Protein Complexes.一种基于网络拓扑性质和蛋白质复合物识别关键蛋白质的新方法。
PLoS One. 2016 Aug 16;11(8):e0161042. doi: 10.1371/journal.pone.0161042. eCollection 2016.
3
Predicting essential proteins based on subcellular localization, orthology and PPI networks.基于亚细胞定位、直系同源性和蛋白质-蛋白质相互作用网络预测必需蛋白质
BMC Bioinformatics. 2016 Aug 31;17 Suppl 8(Suppl 8):279. doi: 10.1186/s12859-016-1115-5.
4
Prediction of essential proteins based on subcellular localization and gene expression correlation.基于亚细胞定位和基因表达相关性的必需蛋白质预测
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):470. doi: 10.1186/s12859-017-1876-5.
5
Identification of Essential Proteins Based on a New Combination of Local Interaction Density and Protein Complexes.基于局部相互作用密度与蛋白质复合物新组合的必需蛋白质鉴定
PLoS One. 2015 Jun 30;10(6):e0131418. doi: 10.1371/journal.pone.0131418. eCollection 2015.
6
UDoNC: An Algorithm for Identifying Essential Proteins Based on Protein Domains and Protein-Protein Interaction Networks.UDoNC:一种基于蛋白质结构域和蛋白质-蛋白质相互作用网络识别必需蛋白质的算法。
IEEE/ACM Trans Comput Biol Bioinform. 2015 Mar-Apr;12(2):276-88. doi: 10.1109/TCBB.2014.2338317.
7
Identifying essential proteins based on sub-network partition and prioritization by integrating subcellular localization information.基于子网络划分和基于细胞内定位信息的优先级排序来识别必需蛋白质。
J Theor Biol. 2018 Jun 14;447:65-73. doi: 10.1016/j.jtbi.2018.03.029. Epub 2018 Mar 21.
8
Identifying protein complexes based on multiple topological structures in PPI networks.基于 PPI 网络中的多种拓扑结构识别蛋白质复合物。
IEEE Trans Nanobioscience. 2013 Sep;12(3):165-72. doi: 10.1109/TNB.2013.2264097. Epub 2013 Aug 21.
9
United Complex Centrality for Identification of Essential Proteins from PPI Networks.用于从蛋白质-蛋白质相互作用网络中识别关键蛋白质的联合复杂中心性
IEEE/ACM Trans Comput Biol Bioinform. 2017 Mar-Apr;14(2):370-380. doi: 10.1109/TCBB.2015.2394487.
10
An iteration model for identifying essential proteins by combining comprehensive PPI network with biological information.一种通过整合综合蛋白质-蛋白质相互作用网络和生物信息来识别必需蛋白质的迭代模型。
BMC Bioinformatics. 2021 Sep 8;22(1):430. doi: 10.1186/s12859-021-04300-7.

引用本文的文献

1
Mitochondrial Dysfunction Pathway Alterations Offer Potential Biomarkers and Therapeutic Targets for Ovarian Cancer.线粒体功能障碍通路改变为卵巢癌提供了有潜力的生物标志物和治疗靶点。
Oxid Med Cell Longev. 2022 Apr 20;2022:5634724. doi: 10.1155/2022/5634724. eCollection 2022.
2
A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins.一种基于新型协同过滤模型的关键蛋白质识别方法。
Front Genet. 2021 Oct 21;12:763153. doi: 10.3389/fgene.2021.763153. eCollection 2021.
3
Essential gene prediction using limited gene essentiality information-An integrative semi-supervised machine learning strategy.

本文引用的文献

1
United Complex Centrality for Identification of Essential Proteins from PPI Networks.用于从蛋白质-蛋白质相互作用网络中识别关键蛋白质的联合复杂中心性
IEEE/ACM Trans Comput Biol Bioinform. 2017 Mar-Apr;14(2):370-380. doi: 10.1109/TCBB.2015.2394487.
2
Predicting essential proteins based on subcellular localization, orthology and PPI networks.基于亚细胞定位、直系同源性和蛋白质-蛋白质相互作用网络预测必需蛋白质
BMC Bioinformatics. 2016 Aug 31;17 Suppl 8(Suppl 8):279. doi: 10.1186/s12859-016-1115-5.
3
A New Method for Identifying Essential Proteins Based on Network Topology Properties and Protein Complexes.
利用有限的基因必需性信息进行必需基因预测——一种综合的半监督机器学习策略。
PLoS One. 2020 Nov 30;15(11):e0242943. doi: 10.1371/journal.pone.0242943. eCollection 2020.
一种基于网络拓扑性质和蛋白质复合物识别关键蛋白质的新方法。
PLoS One. 2016 Aug 16;11(8):e0161042. doi: 10.1371/journal.pone.0161042. eCollection 2016.
4
Essential protein discovery based on a combination of modularity and conservatism.基于模块性和保守性相结合的必需蛋白质发现。
Methods. 2016 Nov 1;110:54-63. doi: 10.1016/j.ymeth.2016.07.005. Epub 2016 Jul 9.
5
A Topology Potential-Based Method for Identifying Essential Proteins from PPI Networks.一种基于拓扑势的从蛋白质-蛋白质相互作用网络中识别必需蛋白质的方法。
IEEE/ACM Trans Comput Biol Bioinform. 2015 Mar-Apr;12(2):372-83. doi: 10.1109/TCBB.2014.2361350.
6
Predicting Essential Proteins Based on Weighted Degree Centrality.基于加权度中心性预测必需蛋白质
IEEE/ACM Trans Comput Biol Bioinform. 2014 Mar-Apr;11(2):407-18. doi: 10.1109/TCBB.2013.2295318.
7
Identification of Essential Proteins Based on a New Combination of Local Interaction Density and Protein Complexes.基于局部相互作用密度与蛋白质复合物新组合的必需蛋白质鉴定
PLoS One. 2015 Jun 30;10(6):e0131418. doi: 10.1371/journal.pone.0131418. eCollection 2015.
8
Rechecking the Centrality-Lethality Rule in the Scope of Protein Subcellular Localization Interaction Networks.在蛋白质亚细胞定位相互作用网络范围内重新审视中心性-致死性规则。
PLoS One. 2015 Jun 26;10(6):e0130743. doi: 10.1371/journal.pone.0130743. eCollection 2015.
9
CytoNCA: a cytoscape plugin for centrality analysis and evaluation of protein interaction networks.CytoNCA:一种用于蛋白质相互作用网络中心性分析与评估的Cytoscape插件。
Biosystems. 2015 Jan;127:67-72. doi: 10.1016/j.biosystems.2014.11.005. Epub 2014 Nov 15.
10
COMPARTMENTS: unification and visualization of protein subcellular localization evidence.区室:蛋白质亚细胞定位证据的整合与可视化
Database (Oxford). 2014 Feb 25;2014:bau012. doi: 10.1093/database/bau012. Print 2014.