• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于非负矩阵三因子分解的关键蛋白质识别新方法。

A Novel Method for Identifying Essential Proteins Based on Non-negative Matrix Tri-Factorization.

作者信息

Zhang Zhihong, Jiang Meiping, Wu Dongjie, Zhang Wang, Yan Wei, Qu Xilong

机构信息

College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China.

School of Information Technology and Management, Hunan University of Finance and Economics, Changsha, China.

出版信息

Front Genet. 2021 Aug 6;12:709660. doi: 10.3389/fgene.2021.709660. eCollection 2021.

DOI:10.3389/fgene.2021.709660
PMID:34422014
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8378176/
Abstract

Identification of essential proteins is very important for understanding the basic requirements to sustain a living organism. In recent years, there has been an increasing interest in using computational methods to predict essential proteins based on protein-protein interaction (PPI) networks or fusing multiple biological information. However, it has been observed that existing PPI data have false-negative and false-positive data. The fusion of multiple biological information can reduce the influence of false data in PPI, but inevitably more noise data will be produced at the same time. In this article, we proposed a novel non-negative matrix tri-factorization (NMTF)-based model (NTMEP) to predict essential proteins. Firstly, a weighted PPI network is established only using the topology features of the network, so as to avoid more noise. To reduce the influence of false data (existing in PPI network) on performance of identify essential proteins, the NMTF technique, as a widely used recommendation algorithm, is performed to reconstruct a most optimized PPI network with more potential protein-protein interactions. Then, we use the PageRank algorithm to compute the final ranking score of each protein, in which subcellular localization and homologous information of proteins were used to calculate the initial scores. In addition, extensive experiments are performed on the publicly available datasets and the results indicate that our NTMEP model has better performance in predicting essential proteins against the start-of-the-art method. In this investigation, we demonstrated that the introduction of non-negative matrix tri-factorization technology can effectively improve the condition of the protein-protein interaction network, so as to reduce the negative impact of noise on the prediction. At the same time, this finding provides a more novel angle of view for other applications based on protein-protein interaction networks.

摘要

识别必需蛋白质对于理解维持生物体生存的基本要求非常重要。近年来,人们越来越有兴趣使用计算方法,基于蛋白质-蛋白质相互作用(PPI)网络或融合多种生物信息来预测必需蛋白质。然而,据观察,现有的PPI数据存在假阴性和假阳性数据。融合多种生物信息可以减少PPI中虚假数据的影响,但同时不可避免地会产生更多噪声数据。在本文中,我们提出了一种基于非负矩阵三因子分解(NMTF)的新型模型(NTMEP)来预测必需蛋白质。首先,仅利用网络的拓扑特征建立加权PPI网络,以避免更多噪声。为了减少虚假数据(存在于PPI网络中)对识别必需蛋白质性能的影响,作为一种广泛使用的推荐算法,采用NMTF技术来重建一个具有更多潜在蛋白质-蛋白质相互作用的最优化PPI网络。然后,我们使用PageRank算法计算每个蛋白质的最终排名分数,其中蛋白质的亚细胞定位和同源信息用于计算初始分数。此外,在公开可用的数据集上进行了广泛的实验,结果表明我们的NTMEP模型在预测必需蛋白质方面比现有最先进的方法具有更好的性能。在这项研究中,我们证明了引入非负矩阵三因子分解技术可以有效改善蛋白质-蛋白质相互作用网络的状况,从而减少噪声对预测的负面影响。同时,这一发现为基于蛋白质-蛋白质相互作用网络的其他应用提供了一个更新颖的视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f72c/8378176/a6c417e1865f/fgene-12-709660-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f72c/8378176/8d57fa865c8a/fgene-12-709660-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f72c/8378176/74de409412cf/fgene-12-709660-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f72c/8378176/a6c417e1865f/fgene-12-709660-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f72c/8378176/8d57fa865c8a/fgene-12-709660-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f72c/8378176/74de409412cf/fgene-12-709660-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f72c/8378176/a6c417e1865f/fgene-12-709660-g003.jpg

相似文献

1
A Novel Method for Identifying Essential Proteins Based on Non-negative Matrix Tri-Factorization.一种基于非负矩阵三因子分解的关键蛋白质识别新方法。
Front Genet. 2021 Aug 6;12:709660. doi: 10.3389/fgene.2021.709660. eCollection 2021.
2
An efficient strategy for identifying essential proteins based on homology, subcellular location and protein-protein interaction information.基于同源性、亚细胞定位和蛋白质-蛋白质相互作用信息鉴定必需蛋白质的有效策略。
Math Biosci Eng. 2022 Apr 20;19(6):6331-6343. doi: 10.3934/mbe.2022296.
3
An iteration method for identifying yeast essential proteins from heterogeneous network.从异质网络中鉴定酵母必需蛋白的迭代方法。
BMC Bioinformatics. 2019 Jun 24;20(1):355. doi: 10.1186/s12859-019-2930-2.
4
Protein functional properties prediction in sparsely-label PPI networks through regularized non-negative matrix factorization.通过正则化非负矩阵分解在稀疏标记的蛋白质-蛋白质相互作用网络中预测蛋白质功能特性
BMC Syst Biol. 2015;9 Suppl 1(Suppl 1):S9. doi: 10.1186/1752-0509-9-S1-S9. Epub 2015 Jan 21.
5
Fuse: multiple network alignment via data fusion.Fuse:通过数据融合进行多重网络比对。
Bioinformatics. 2016 Apr 15;32(8):1195-203. doi: 10.1093/bioinformatics/btv731. Epub 2015 Dec 14.
6
Identifying Protein Complexes With Clear Module Structure Using Pairwise Constraints in Protein Interaction Networks.利用蛋白质相互作用网络中的成对约束识别具有清晰模块结构的蛋白质复合物
Front Genet. 2021 Aug 27;12:664786. doi: 10.3389/fgene.2021.664786. eCollection 2021.
7
Identifying essential proteins based on sub-network partition and prioritization by integrating subcellular localization information.基于子网络划分和基于细胞内定位信息的优先级排序来识别必需蛋白质。
J Theor Biol. 2018 Jun 14;447:65-73. doi: 10.1016/j.jtbi.2018.03.029. Epub 2018 Mar 21.
8
Method for Essential Protein Prediction Based on a Novel Weighted Protein-Domain Interaction Network.基于新型加权蛋白质-结构域相互作用网络的必需蛋白质预测方法
Front Genet. 2021 Mar 17;12:645932. doi: 10.3389/fgene.2021.645932. eCollection 2021.
9
A novel essential protein identification method based on PPI networks and gene expression data.一种基于 PPI 网络和基因表达数据的新型必需蛋白鉴定方法。
BMC Bioinformatics. 2021 May 13;22(1):248. doi: 10.1186/s12859-021-04175-8.
10
Robust capped norm dual hyper-graph regularized non-negative matrix tri-factorization.鲁棒帽范数对偶超图正则化非负矩阵三因子分解
Math Biosci Eng. 2023 May 24;20(7):12486-12509. doi: 10.3934/mbe.2023556.

引用本文的文献

1
Essential proteins discovery based on dominance relationship and neighborhood similarity centrality.基于支配关系和邻域相似性中心性的必需蛋白质发现
Health Inf Sci Syst. 2023 Nov 16;11(1):55. doi: 10.1007/s13755-023-00252-9. eCollection 2023 Dec.
2
Network and pathway-based analysis of candidate genes associated with esophageal adenocarcinoma.基于网络和通路的食管腺癌相关候选基因分析
J Gastrointest Oncol. 2023 Feb 28;14(1):40-53. doi: 10.21037/jgo-22-1286. Epub 2023 Feb 15.

本文引用的文献

1
A novel essential protein identification method based on PPI networks and gene expression data.一种基于 PPI 网络和基因表达数据的新型必需蛋白鉴定方法。
BMC Bioinformatics. 2021 May 13;22(1):248. doi: 10.1186/s12859-021-04175-8.
2
A Novel Method for Identifying Essential Genes by Fusing Dynamic Protein⁻Protein Interactive Networks.一种融合动态蛋白质-蛋白质相互作用网络的识别必需基因的新方法。
Genes (Basel). 2019 Jan 8;10(1):31. doi: 10.3390/genes10010031.
3
United Neighborhood Closeness Centrality and Orthology for Predicting Essential Proteins.
基于邻近中心度和同源性的核心蛋白质预测
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1451-1458. doi: 10.1109/TCBB.2018.2889978. Epub 2018 Dec 27.
4
Improved flower pollination algorithm for identifying essential proteins.用于识别必需蛋白质的改进型花授粉算法
BMC Syst Biol. 2018 Apr 24;12(Suppl 4):46. doi: 10.1186/s12918-018-0573-y.
5
Detecting Essential Proteins Based on Network Topology, Gene Expression Data, and Gene Ontology Information.基于网络拓扑、基因表达数据和基因本体论信息检测必需蛋白质。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Jan-Feb;15(1):109-116. doi: 10.1109/TCBB.2016.2615931. Epub 2016 Oct 7.
6
Predicting essential proteins based on subcellular localization, orthology and PPI networks.基于亚细胞定位、直系同源性和蛋白质-蛋白质相互作用网络预测必需蛋白质
BMC Bioinformatics. 2016 Aug 31;17 Suppl 8(Suppl 8):279. doi: 10.1186/s12859-016-1115-5.
7
Essential protein discovery based on a combination of modularity and conservatism.基于模块性和保守性相结合的必需蛋白质发现。
Methods. 2016 Nov 1;110:54-63. doi: 10.1016/j.ymeth.2016.07.005. Epub 2016 Jul 9.
8
A Nonnegative Latent Factor Model for Large-Scale Sparse Matrices in Recommender Systems via Alternating Direction Method.通过交替方向法对推荐系统中大规模稀疏矩阵进行非负潜在因子建模。
IEEE Trans Neural Netw Learn Syst. 2016 Mar;27(3):579-92. doi: 10.1109/TNNLS.2015.2415257. Epub 2015 May 22.
9
Prediction of essential proteins based on overlapping essential modules.基于重叠必需模块的必需蛋白质预测
IEEE Trans Nanobioscience. 2014 Dec;13(4):415-24. doi: 10.1109/TNB.2014.2337912. Epub 2014 Aug 7.
10
COMPARTMENTS: unification and visualization of protein subcellular localization evidence.区室:蛋白质亚细胞定位证据的整合与可视化
Database (Oxford). 2014 Feb 25;2014:bau012. doi: 10.1093/database/bau012. Print 2014.