• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于新型协同过滤模型的关键蛋白质识别方法。

A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins.

作者信息

Zhu Xianyou, He Xin, Kuang Linai, Chen Zhiping, Lancine Camara

机构信息

College of Computer Science and Technology, Hengyang Normal University, Hengyang, China.

Hunan Provincial Key Laboratory of Intelligent Information Processing and Application, Hengyang, China.

出版信息

Front Genet. 2021 Oct 21;12:763153. doi: 10.3389/fgene.2021.763153. eCollection 2021.

DOI:10.3389/fgene.2021.763153
PMID:34745230
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8566338/
Abstract

Considering that traditional biological experiments are expensive and time consuming, it is important to develop effective computational models to infer potential essential proteins. In this manuscript, a novel collaborative filtering model-based method called CFMM was proposed, in which, an updated protein-domain interaction (PDI) network was constructed first by applying collaborative filtering algorithm on the original PDI network, and then, through integrating topological features of PDI networks with biological features of proteins, a calculative method was designed to infer potential essential proteins based on an improved PageRank algorithm. The novelties of CFMM lie in construction of an updated PDI network, application of the commodity-customer-based collaborative filtering algorithm, and introduction of the calculation method based on an improved PageRank algorithm, which ensured that CFMM can be applied to predict essential proteins without relying entirely on known protein-domain associations. Simulation results showed that CFMM can achieve reliable prediction accuracies of 92.16, 83.14, 71.37, 63.87, 55.84, and 52.43% in the top 1, 5, 10, 15, 20, and 25% predicted candidate key proteins based on the DIP database, which are remarkably higher than 14 competitive state-of-the-art predictive models as a whole, and in addition, CFMM can achieve satisfactory predictive performances based on different databases with various evaluation measurements, which further indicated that CFMM may be a useful tool for the identification of essential proteins in the future.

摘要

考虑到传统生物学实验成本高昂且耗时,开发有效的计算模型以推断潜在的必需蛋白质非常重要。在本论文中,提出了一种基于协同过滤模型的新方法CFMM,其中,首先通过在原始蛋白质-结构域相互作用(PDI)网络上应用协同过滤算法构建更新的PDI网络,然后,通过将PDI网络的拓扑特征与蛋白质的生物学特征相结合,设计了一种基于改进的PageRank算法来推断潜在必需蛋白质的计算方法。CFMM的新颖之处在于构建更新的PDI网络、应用基于商品-客户的协同过滤算法以及引入基于改进的PageRank算法的计算方法,这确保了CFMM可以在不完全依赖已知蛋白质-结构域关联的情况下用于预测必需蛋白质。模拟结果表明,基于DIP数据库,CFMM在预测的前1%、5%、10%、15%、20%和25%的候选关键蛋白质中分别能达到92.16%、83.14%、71.37%、63.87%、55.84%和52.43%的可靠预测准确率,总体上显著高于14种具有竞争力的先进预测模型,此外,CFMM基于不同数据库并采用各种评估指标时都能取得令人满意的预测性能,这进一步表明CFMM未来可能是识别必需蛋白质的有用工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/992dec1097a0/fgene-12-763153-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/1886bf19b0aa/fgene-12-763153-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/1f7a8a1751c3/fgene-12-763153-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/775afb4e3265/fgene-12-763153-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/de0ef82724d1/fgene-12-763153-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/0c0d5e308434/fgene-12-763153-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/0b1683da1400/fgene-12-763153-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/06b49f22a1f4/fgene-12-763153-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/37d924ecf580/fgene-12-763153-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/b46b196fe3e1/fgene-12-763153-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/992dec1097a0/fgene-12-763153-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/1886bf19b0aa/fgene-12-763153-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/1f7a8a1751c3/fgene-12-763153-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/775afb4e3265/fgene-12-763153-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/de0ef82724d1/fgene-12-763153-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/0c0d5e308434/fgene-12-763153-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/0b1683da1400/fgene-12-763153-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/06b49f22a1f4/fgene-12-763153-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/37d924ecf580/fgene-12-763153-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/b46b196fe3e1/fgene-12-763153-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d713/8566338/992dec1097a0/fgene-12-763153-g010.jpg

相似文献

1
A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins.一种基于新型协同过滤模型的关键蛋白质识别方法。
Front Genet. 2021 Oct 21;12:763153. doi: 10.3389/fgene.2021.763153. eCollection 2021.
2
A novel collaborative filtering model for LncRNA-disease association prediction based on the Naïve Bayesian classifier.基于朴素贝叶斯分类器的新型长链非编码 RNA 疾病关联预测协同过滤模型。
BMC Bioinformatics. 2019 Jul 17;20(1):396. doi: 10.1186/s12859-019-2985-0.
3
Method for Essential Protein Prediction Based on a Novel Weighted Protein-Domain Interaction Network.基于新型加权蛋白质-结构域相互作用网络的必需蛋白质预测方法
Front Genet. 2021 Mar 17;12:645932. doi: 10.3389/fgene.2021.645932. eCollection 2021.
4
Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network.基于新型蛋白质结构域网络中蛋白质关键特征识别必需蛋白质的方法
Front Genet. 2021 Jun 29;12:708162. doi: 10.3389/fgene.2021.708162. eCollection 2021.
5
A Novel Model for Identifying Essential Proteins Based on Key Target Convergence Sets.一种基于关键靶点收敛集识别必需蛋白质的新模型。
Front Genet. 2021 Jul 29;12:721486. doi: 10.3389/fgene.2021.721486. eCollection 2021.
6
An iteration model for identifying essential proteins by combining comprehensive PPI network with biological information.一种通过整合综合蛋白质-蛋白质相互作用网络和生物信息来识别必需蛋白质的迭代模型。
BMC Bioinformatics. 2021 Sep 8;22(1):430. doi: 10.1186/s12859-021-04300-7.
7
A disease-related essential protein prediction model based on the transfer neural network.一种基于迁移神经网络的疾病相关必需蛋白质预测模型。
Front Genet. 2023 Jan 4;13:1087294. doi: 10.3389/fgene.2022.1087294. eCollection 2022.
8
Identification of Essential Proteins Based on Local Random Walk and Adaptive Multi-View Multi-Label Learning.基于局部随机游走和自适应多视图多标签学习的关键蛋白质识别
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3507-3516. doi: 10.1109/TCBB.2021.3128638. Epub 2022 Dec 8.
9
A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations.基于新型目标收敛集的重启动随机游走算法预测潜在的 lncRNA-疾病关联
BMC Bioinformatics. 2019 Dec 3;20(1):626. doi: 10.1186/s12859-019-3216-4.
10
A Novel Method for Identifying Essential Proteins Based on Non-negative Matrix Tri-Factorization.一种基于非负矩阵三因子分解的关键蛋白质识别新方法。
Front Genet. 2021 Aug 6;12:709660. doi: 10.3389/fgene.2021.709660. eCollection 2021.

本文引用的文献

1
Method for Essential Protein Prediction Based on a Novel Weighted Protein-Domain Interaction Network.基于新型加权蛋白质-结构域相互作用网络的必需蛋白质预测方法
Front Genet. 2021 Mar 17;12:645932. doi: 10.3389/fgene.2021.645932. eCollection 2021.
2
RNMFMDA: A Microbe-Disease Association Identification Method Based on Reliable Negative Sample Selection and Logistic Matrix Factorization With Neighborhood Regularization.RNMFMDA:一种基于可靠负样本选择和带邻域正则化的逻辑矩阵分解的微生物-疾病关联识别方法。
Front Microbiol. 2020 Oct 27;11:592430. doi: 10.3389/fmicb.2020.592430. eCollection 2020.
3
A novel scheme for essential protein discovery based on multi-source biological information.
一种基于多源生物信息的必需蛋白质发现新方案。
J Theor Biol. 2020 Nov 7;504:110414. doi: 10.1016/j.jtbi.2020.110414. Epub 2020 Jul 23.
4
An iteration method for identifying yeast essential proteins from heterogeneous network.从异质网络中鉴定酵母必需蛋白的迭代方法。
BMC Bioinformatics. 2019 Jun 24;20(1):355. doi: 10.1186/s12859-019-2930-2.
5
Revealing Drug-Target Interactions with Computational Models and Algorithms.揭示药物-靶标相互作用的计算模型和算法。
Molecules. 2019 May 2;24(9):1714. doi: 10.3390/molecules24091714.
6
Feature Selection via Swarm Intelligence for Determining Protein Essentiality.基于群体智能的蛋白质必需性预测特征选择
Molecules. 2018 Jun 28;23(7):1569. doi: 10.3390/molecules23071569.
7
Predicting essential proteins by integrating orthology, gene expressions, and PPI networks.通过整合直系同源、基因表达和 PPI 网络预测必需蛋白质。
PLoS One. 2018 Apr 10;13(4):e0195410. doi: 10.1371/journal.pone.0195410. eCollection 2018.
8
Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways.利用基因本体论和KEGG通路的富集对必需基因进行预测和分析。
PLoS One. 2017 Sep 5;12(9):e0184129. doi: 10.1371/journal.pone.0184129. eCollection 2017.
9
A new computational strategy for identifying essential proteins based on network topological properties and biological information.一种基于网络拓扑特性和生物信息识别必需蛋白质的新计算策略。
PLoS One. 2017 Jul 28;12(7):e0182031. doi: 10.1371/journal.pone.0182031. eCollection 2017.
10
Detecting Essential Proteins Based on Network Topology, Gene Expression Data, and Gene Ontology Information.基于网络拓扑、基因表达数据和基因本体论信息检测必需蛋白质。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Jan-Feb;15(1):109-116. doi: 10.1109/TCBB.2016.2615931. Epub 2016 Oct 7.