• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于边缘特征和多源生物信息融合的必需蛋白质鉴定。

Identification of essential proteins based on edge features and the fusion of multiple-source biological information.

机构信息

School of Computer Science and Technology, Shandong Technology and Business University, Yantai, China.

College of Oceanography and Space Informatics, China University of Petroleum (East China), Qingdao, China.

出版信息

BMC Bioinformatics. 2023 May 17;24(1):203. doi: 10.1186/s12859-023-05315-y.

DOI:10.1186/s12859-023-05315-y
PMID:37198530
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10193741/
Abstract

BACKGROUND

A major current focus in the analysis of protein-protein interaction (PPI) data is how to identify essential proteins. As massive PPI data are available, this warrants the design of efficient computing methods for identifying essential proteins. Previous studies have achieved considerable performance. However, as a consequence of the features of high noise and structural complexity in PPIs, it is still a challenge to further upgrade the performance of the identification methods.

METHODS

This paper proposes an identification method, named CTF, which identifies essential proteins based on edge features including h-quasi-cliques and uv-triangle graphs and the fusion of multiple-source information. We first design an edge-weight function, named EWCT, for computing the topological scores of proteins based on quasi-cliques and triangle graphs. Then, we generate an edge-weighted PPI network using EWCT and dynamic PPI data. Finally, we compute the essentiality of proteins by the fusion of topological scores and three scores of biological information.

RESULTS

We evaluated the performance of the CTF method by comparison with 16 other methods, such as MON, PeC, TEGS, and LBCC, the experiment results on three datasets of Saccharomyces cerevisiae show that CTF outperforms the state-of-the-art methods. Moreover, our method indicates that the fusion of other biological information is beneficial to improve the accuracy of identification.

摘要

背景

目前,蛋白质-蛋白质相互作用(PPI)数据分析的一个主要焦点是如何识别必需蛋白质。由于大量的 PPI 数据可用,这就需要设计有效的计算方法来识别必需蛋白质。以前的研究已经取得了相当大的成果。然而,由于 PPI 中存在高噪声和结构复杂性的特点,进一步提高识别方法的性能仍然是一个挑战。

方法

本文提出了一种识别方法,称为 CTF,它基于边特征(包括 h-拟簇和 uv-三角形图)和多源信息融合来识别必需蛋白质。我们首先设计了一种边权重函数,称为 EWCT,用于基于拟簇和三角形图计算蛋白质的拓扑分数。然后,我们使用 EWCT 和动态 PPI 数据生成一个边加权 PPI 网络。最后,我们通过融合拓扑分数和三种生物信息分数来计算蛋白质的必需性。

结果

我们通过与其他 16 种方法(如 MON、PeC、TEGS 和 LBCC)进行比较,评估了 CTF 方法的性能,实验结果表明,在三个酿酒酵母数据集上,CTF 方法优于最先进的方法。此外,我们的方法表明融合其他生物信息有助于提高识别的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/9598727f72e4/12859_2023_5315_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/44344ea8e719/12859_2023_5315_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/7f7cb5c39243/12859_2023_5315_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/4acb81b574d5/12859_2023_5315_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/fd56c8c8f54c/12859_2023_5315_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/921d32608b8d/12859_2023_5315_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/9598727f72e4/12859_2023_5315_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/44344ea8e719/12859_2023_5315_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/7f7cb5c39243/12859_2023_5315_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/4acb81b574d5/12859_2023_5315_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/fd56c8c8f54c/12859_2023_5315_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/921d32608b8d/12859_2023_5315_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/867b/10193741/9598727f72e4/12859_2023_5315_Fig6_HTML.jpg

相似文献

1
Identification of essential proteins based on edge features and the fusion of multiple-source biological information.基于边缘特征和多源生物信息融合的必需蛋白质鉴定。
BMC Bioinformatics. 2023 May 17;24(1):203. doi: 10.1186/s12859-023-05315-y.
2
A new method for the discovery of essential proteins.一种发现必需蛋白的新方法。
PLoS One. 2013;8(3):e58763. doi: 10.1371/journal.pone.0058763. Epub 2013 Mar 21.
3
Predicting Essential Proteins by Integrating Network Topology, Subcellular Localization Information, Gene Expression Profile and GO Annotation Data.通过整合网络拓扑结构、亚细胞定位信息、基因表达谱和 GO 注释数据预测必需蛋白。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):2053-2061. doi: 10.1109/TCBB.2019.2916038. Epub 2020 Dec 8.
4
A new method for predicting essential proteins based on participation degree in protein complex and subgraph density.基于参与蛋白复合物和子图密度的预测必需蛋白的新方法。
PLoS One. 2018 Jun 12;13(6):e0198998. doi: 10.1371/journal.pone.0198998. eCollection 2018.
5
Identifying essential proteins from protein-protein interaction networks based on influence maximization.基于影响最大化的蛋白质-蛋白质相互作用网络中关键蛋白质的识别。
BMC Bioinformatics. 2022 Aug 16;23(Suppl 8):339. doi: 10.1186/s12859-022-04874-w.
6
Identification of Essential Proteins Based on a New Combination of Local Interaction Density and Protein Complexes.基于局部相互作用密度与蛋白质复合物新组合的必需蛋白质鉴定
PLoS One. 2015 Jun 30;10(6):e0131418. doi: 10.1371/journal.pone.0131418. eCollection 2015.
7
An iteration method for identifying yeast essential proteins from heterogeneous network.从异质网络中鉴定酵母必需蛋白的迭代方法。
BMC Bioinformatics. 2019 Jun 24;20(1):355. doi: 10.1186/s12859-019-2930-2.
8
A novel essential protein identification method based on PPI networks and gene expression data.一种基于 PPI 网络和基因表达数据的新型必需蛋白鉴定方法。
BMC Bioinformatics. 2021 May 13;22(1):248. doi: 10.1186/s12859-021-04175-8.
9
A novel method to predict essential proteins based on tensor and HITS algorithm.一种基于张量和HITS算法预测必需蛋白质的新方法。
Hum Genomics. 2020 Apr 6;14(1):14. doi: 10.1186/s40246-020-00263-7.
10
A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks.一种用于从多个异构网络中检测蛋白质复合物的多网络聚类方法。
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):463. doi: 10.1186/s12859-017-1877-4.

引用本文的文献

1
ECDEP: identifying essential proteins based on evolutionary community discovery and subcellular localization.ECDEP:基于进化群落发现和亚细胞定位识别必需蛋白质
BMC Genomics. 2024 Jan 26;25(1):117. doi: 10.1186/s12864-024-10019-5.

本文引用的文献

1
A deep learning framework for identifying essential proteins based on multiple biological information.基于多种生物信息识别必需蛋白的深度学习框架。
BMC Bioinformatics. 2022 Aug 4;23(1):318. doi: 10.1186/s12859-022-04868-8.
2
Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network.基于新型蛋白质结构域网络中蛋白质关键特征识别必需蛋白质的方法
Front Genet. 2021 Jun 29;12:708162. doi: 10.3389/fgene.2021.708162. eCollection 2021.
3
A novel essential protein identification method based on PPI networks and gene expression data.
一种基于 PPI 网络和基因表达数据的新型必需蛋白鉴定方法。
BMC Bioinformatics. 2021 May 13;22(1):248. doi: 10.1186/s12859-021-04175-8.
4
Method for Essential Protein Prediction Based on a Novel Weighted Protein-Domain Interaction Network.基于新型加权蛋白质-结构域相互作用网络的必需蛋白质预测方法
Front Genet. 2021 Mar 17;12:645932. doi: 10.3389/fgene.2021.645932. eCollection 2021.
5
CEGSO: Boosting Essential Proteins Prediction by Integrating Protein Complex, Gene Expression, Gene Ontology, Subcellular Localization and Orthology Information.CEGSO:通过整合蛋白质复合物、基因表达、基因本体论、亚细胞定位和同源信息来提高必需蛋白预测。
Interdiscip Sci. 2021 Sep;13(3):349-361. doi: 10.1007/s12539-021-00426-7. Epub 2021 Mar 27.
6
frDriver: A Functional Region Driver Identification for Protein Sequence.frDriver:蛋白质序列的功能区域驱动识别。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1773-1783. doi: 10.1109/TCBB.2020.3020096. Epub 2021 Oct 7.
7
A Novel Computational Approach for Identifying Essential Proteins From Multiplex Biological Networks.一种从多重生物网络中识别必需蛋白质的新型计算方法。
Front Genet. 2020 Apr 21;11:343. doi: 10.3389/fgene.2020.00343. eCollection 2020.
8
Predicting essential proteins from protein-protein interactions using order statistics.利用排序统计量从蛋白质-蛋白质相互作用预测必需蛋白质。
J Theor Biol. 2019 Nov 7;480:274-283. doi: 10.1016/j.jtbi.2019.06.022. Epub 2019 Jun 25.
9
Predicting Essential Proteins by Integrating Network Topology, Subcellular Localization Information, Gene Expression Profile and GO Annotation Data.通过整合网络拓扑结构、亚细胞定位信息、基因表达谱和 GO 注释数据预测必需蛋白。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):2053-2061. doi: 10.1109/TCBB.2019.2916038. Epub 2020 Dec 8.
10
A Deep Learning Framework for Identifying Essential Proteins by Integrating Multiple Types of Biological Information.一种通过整合多种类型生物信息来识别必需蛋白质的深度学习框架。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):296-305. doi: 10.1109/TCBB.2019.2897679. Epub 2021 Feb 3.