• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于核偏最小二乘回归的功能关联网络基因功能预测

Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression.

作者信息

Lehtinen Sonja, Lees Jon, Bähler Jürg, Shawe-Taylor John, Orengo Christine

机构信息

CoMPLEX, University College London, London, United Kingdom; Institute of Structural and Molecular Biology, University College London, London, United Kingdom.

Institute of Structural and Molecular Biology, University College London, London, United Kingdom.

出版信息

PLoS One. 2015 Aug 19;10(8):e0134668. doi: 10.1371/journal.pone.0134668. eCollection 2015.

DOI:10.1371/journal.pone.0134668
PMID:26288239
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4545790/
Abstract

With the growing availability of large-scale biological datasets, automated methods of extracting functionally meaningful information from this data are becoming increasingly important. Data relating to functional association between genes or proteins, such as co-expression or functional association, is often represented in terms of gene or protein networks. Several methods of predicting gene function from these networks have been proposed. However, evaluating the relative performance of these algorithms may not be trivial: concerns have been raised over biases in different benchmarking methods and datasets, particularly relating to non-independence of functional association data and test data. In this paper we propose a new network-based gene function prediction algorithm using a commute-time kernel and partial least squares regression (Compass). We compare Compass to GeneMANIA, a leading network-based prediction algorithm, using a number of different benchmarks, and find that Compass outperforms GeneMANIA on these benchmarks. We also explicitly explore problems associated with the non-independence of functional association data and test data. We find that a benchmark based on the Gene Ontology database, which, directly or indirectly, incorporates information from other databases, may considerably overestimate the performance of algorithms exploiting functional association data for prediction.

摘要

随着大规模生物数据集越来越容易获取,从这些数据中提取具有功能意义信息的自动化方法正变得日益重要。与基因或蛋白质之间功能关联相关的数据,如共表达或功能关联,通常以基因或蛋白质网络的形式呈现。已经提出了几种从这些网络预测基因功能的方法。然而,评估这些算法的相对性能并非易事:不同的基准测试方法和数据集存在偏差的问题已被提出,特别是与功能关联数据和测试数据的非独立性有关。在本文中,我们提出了一种使用通勤时间核和偏最小二乘回归的基于网络的新基因功能预测算法(Compass)。我们使用多种不同的基准测试将Compass与领先的基于网络的预测算法GeneMANIA进行比较,发现Compass在这些基准测试上优于GeneMANIA。我们还明确探讨了与功能关联数据和测试数据的非独立性相关的问题。我们发现基于基因本体数据库的基准测试,该数据库直接或间接地纳入了来自其他数据库的信息,可能会大大高估利用功能关联数据进行预测的算法的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/ac86f56ce3c6/pone.0134668.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/fe9534b22951/pone.0134668.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/01baba1d48cb/pone.0134668.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/a068f9455a26/pone.0134668.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/0455d83d2a9f/pone.0134668.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/ac86f56ce3c6/pone.0134668.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/fe9534b22951/pone.0134668.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/01baba1d48cb/pone.0134668.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/a068f9455a26/pone.0134668.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/0455d83d2a9f/pone.0134668.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76a6/4545790/ac86f56ce3c6/pone.0134668.g005.jpg

相似文献

1
Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression.基于核偏最小二乘回归的功能关联网络基因功能预测
PLoS One. 2015 Aug 19;10(8):e0134668. doi: 10.1371/journal.pone.0134668. eCollection 2015.
2
Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes.使用功能类别参考集评估基因表达数据聚类算法的方法。
BMC Bioinformatics. 2006 Aug 31;7:397. doi: 10.1186/1471-2105-7-397.
3
GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function.GeneMANIA:一种用于预测基因功能的实时多重关联网络整合算法。
Genome Biol. 2008;9 Suppl 1(Suppl 1):S4. doi: 10.1186/gb-2008-9-s1-s4. Epub 2008 Jun 27.
4
Exploiting ontology graph for predicting sparsely annotated gene function.利用本体图预测注释稀疏的基因功能。
Bioinformatics. 2015 Jun 15;31(12):i357-64. doi: 10.1093/bioinformatics/btv260.
5
Deviance residuals-based sparse PLS and sparse kernel PLS regression for censored data.基于偏差残差的稀疏偏最小二乘和稀疏核偏最小二乘回归用于删失数据。
Bioinformatics. 2015 Feb 1;31(3):397-404. doi: 10.1093/bioinformatics/btu660. Epub 2014 Oct 6.
6
Identifying potential association on gene-disease network via dual hypergraph regularized least squares.基于双超图正则化最小二乘法的基因-疾病网络潜在关联识别。
BMC Genomics. 2021 Aug 9;22(1):605. doi: 10.1186/s12864-021-07864-z.
7
bLARS: An Algorithm to Infer Gene Regulatory Networks.bLARS:一种推断基因调控网络的算法
IEEE/ACM Trans Comput Biol Bioinform. 2016 Mar-Apr;13(2):301-14. doi: 10.1109/TCBB.2015.2450740.
8
In silico gene function prediction using ontology-based pattern identification.使用基于本体的模式识别进行计算机基因功能预测。
Bioinformatics. 2005 Apr 1;21(7):1237-45. doi: 10.1093/bioinformatics/bti111. Epub 2004 Nov 5.
9
The ArrayExpress gene expression database: a software engineering and implementation perspective.ArrayExpress基因表达数据库:软件工程与实现视角
Bioinformatics. 2005 Apr 15;21(8):1495-501. doi: 10.1093/bioinformatics/bti157. Epub 2004 Nov 25.
10
SGCP: a spectral self-learning method for clustering genes in co-expression networks.SGCP:一种用于共表达网络中基因聚类的光谱自学习方法。
BMC Bioinformatics. 2024 Jul 2;25(1):230. doi: 10.1186/s12859-024-05848-w.

引用本文的文献

1
Refining modules to determine functionally significant clusters in molecular networks.精炼模块以确定分子网络中功能显著的簇。
BMC Genomics. 2019 Dec 24;20(Suppl 9):901. doi: 10.1186/s12864-019-6294-9.
2
Inferring Disease-Associated MicroRNAs Using Semi-supervised Multi-Label Graph Convolutional Networks.使用半监督多标签图卷积网络推断疾病相关的微小RNA
iScience. 2019 Oct 25;20:265-277. doi: 10.1016/j.isci.2019.09.013. Epub 2019 Sep 16.
3
Bacteria in Cancer Therapeutics: A Framework for Effective Therapeutic Bacterial Screening and Identification.

本文引用的文献

1
Parallel profiling of fission yeast deletion mutants for proliferation and for lifespan during long-term quiescence.在长期静止期对裂殖酵母缺失突变体进行增殖和寿命的平行分析。
G3 (Bethesda). 2014 Dec 1;5(1):145-55. doi: 10.1534/g3.114.014415.
2
Deviance residuals-based sparse PLS and sparse kernel PLS regression for censored data.基于偏差残差的稀疏偏最小二乘和稀疏核偏最小二乘回归用于删失数据。
Bioinformatics. 2015 Feb 1;31(3):397-404. doi: 10.1093/bioinformatics/btu660. Epub 2014 Oct 6.
3
Integration of biological data by kernels on graph nodes allows prediction of new genes involved in mitotic chromosome condensation.
癌症治疗中的细菌:有效治疗性细菌筛选与鉴定框架
J Cancer. 2019 Apr 21;10(8):1781-1793. doi: 10.7150/jca.31699. eCollection 2019.
4
Network propagation: a universal amplifier of genetic associations.网络传播:遗传关联的通用放大器。
Nat Rev Genet. 2017 Sep;18(9):551-562. doi: 10.1038/nrg.2017.38. Epub 2017 Jun 12.
5
Defining functional interactions during biogenesis of epithelial junctions.定义上皮连接生物发生过程中的功能相互作用。
Nat Commun. 2016 Dec 6;7:13542. doi: 10.1038/ncomms13542.
通过图节点上的核进行生物数据整合,能够预测参与有丝分裂染色体凝聚的新基因。
Mol Biol Cell. 2014 Aug 15;25(16):2522-36. doi: 10.1091/mbc.E13-04-0221. Epub 2014 Jun 18.
4
Protein-protein interaction detection: methods and analysis.蛋白质-蛋白质相互作用检测:方法与分析
Int J Proteomics. 2014;2014:147648. doi: 10.1155/2014/147648. Epub 2014 Feb 17.
5
CAFA and the open world of protein function predictions.计算机辅助功能注释(CAFA)与蛋白质功能预测的开放世界
Trends Genet. 2013 Nov;29(11):609-10. doi: 10.1016/j.tig.2013.09.005. Epub 2013 Oct 15.
6
Identification of a lifespan extending mutation in the Schizosaccharomyces pombe cyclin gene clg1+ by direct selection of long-lived mutants.通过直接筛选长寿命突变体,鉴定出裂殖酵母细胞周期蛋白基因 clg1+中的一个延长寿命的突变。
PLoS One. 2013 Jul 9;8(7):e69084. doi: 10.1371/journal.pone.0069084. Print 2013.
7
Characterizing the state of the art in the computational assignment of gene function: lessons from the first critical assessment of functional annotation (CAFA).描述计算基因功能分配方面的最新技术状态:从第一次功能注释(CAFA)的关键评估中吸取的教训。
BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S15. doi: 10.1186/1471-2105-14-s3-s15.
8
A large-scale evaluation of computational protein function prediction.大规模计算蛋白质功能预测评估。
Nat Methods. 2013 Mar;10(3):221-7. doi: 10.1038/nmeth.2340. Epub 2013 Jan 27.
9
Assessing identity, redundancy and confounds in Gene Ontology annotations over time.评估基因本体论注释随时间的变化中的身份、冗余和混淆。
Bioinformatics. 2013 Feb 15;29(4):476-82. doi: 10.1093/bioinformatics/bts727. Epub 2013 Jan 6.
10
GenomeRNAi: a database for cell-based and in vivo RNAi phenotypes, 2013 update.GenomeRNAi:一个基于细胞和体内 RNAi 表型的数据库,2013 年更新。
Nucleic Acids Res. 2013 Jan;41(Database issue):D1021-6. doi: 10.1093/nar/gks1170. Epub 2012 Nov 27.