Suppr超能文献

通过网络引导的矩阵补全在上位性全基因组关联图谱中的数据插补

Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion.

作者信息

Žitnik Marinka, Zupan Blaž

机构信息

1Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia.

2Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.

出版信息

J Comput Biol. 2015 Jun;22(6):595-608. doi: 10.1089/cmb.2014.0158. Epub 2015 Feb 6.

Abstract

Epistatic miniarray profile (E-MAP) is a popular large-scale genetic interaction discovery platform. E-MAPs benefit from quantitative output, which makes it possible to detect subtle interactions with greater precision. However, due to the limits of biotechnology, E-MAP studies fail to measure genetic interactions for up to 40% of gene pairs in an assay. Missing measurements can be recovered by computational techniques for data imputation, in this way completing the interaction profiles and enabling downstream analysis algorithms that could otherwise be sensitive to missing data values. We introduce a new interaction data imputation method called network-guided matrix completion (NG-MC). The core part of NG-MC is low-rank probabilistic matrix completion that incorporates prior knowledge presented as a collection of gene networks. NG-MC assumes that interactions are transitive, such that latent gene interaction profiles inferred by NG-MC depend on the profiles of their direct neighbors in gene networks. As the NG-MC inference algorithm progresses, it propagates latent interaction profiles through each of the networks and updates gene network weights toward improved prediction. In a study with four different E-MAP data assays and considered protein-protein interaction and gene ontology similarity networks, NG-MC significantly surpassed existing alternative techniques. Inclusion of information from gene networks also allowed NG-MC to predict interactions for genes that were not included in original E-MAP assays, a task that could not be considered by current imputation approaches.

摘要

上位性微阵列谱(E-MAP)是一个广受欢迎的大规模遗传相互作用发现平台。E-MAP受益于定量输出,这使得以更高的精度检测微妙的相互作用成为可能。然而,由于生物技术的局限性,E-MAP研究在一次检测中无法测量高达40%的基因对之间的遗传相互作用。缺失的测量值可以通过数据插补的计算技术来恢复,通过这种方式完成相互作用谱,并启用否则可能对缺失数据值敏感的下游分析算法。我们引入了一种新的相互作用数据插补方法,称为网络引导矩阵补全(NG-MC)。NG-MC的核心部分是低秩概率矩阵补全,它结合了作为基因网络集合呈现的先验知识。NG-MC假设相互作用是可传递的,因此NG-MC推断出的潜在基因相互作用谱取决于其在基因网络中直接邻居的谱。随着NG-MC推理算法的推进,它通过每个网络传播潜在相互作用谱,并朝着改进预测的方向更新基因网络权重。在一项使用四种不同E-MAP数据检测并考虑蛋白质-蛋白质相互作用和基因本体相似性网络的研究中,NG-MC显著超越了现有的替代技术。纳入来自基因网络的信息还使NG-MC能够预测原始E-MAP检测中未包含的基因之间的相互作用,这是当前插补方法无法完成的任务。

相似文献

1
Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion.
J Comput Biol. 2015 Jun;22(6):595-608. doi: 10.1089/cmb.2014.0158. Epub 2015 Feb 6.
2
Imputing missing values for genetic interaction data.
Methods. 2014 Jun 1;67(3):269-77. doi: 10.1016/j.ymeth.2014.03.032. Epub 2014 Apr 6.
3
Missing value imputation for epistatic MAPs.
BMC Bioinformatics. 2010 Apr 20;11:197. doi: 10.1186/1471-2105-11-197.
4
Imputing and predicting quantitative genetic interactions in epistatic MAPs.
Methods Mol Biol. 2011;781:353-61. doi: 10.1007/978-1-61779-276-2_17.
5
Exploitation of genetic interaction network topology for the prediction of epistatic behavior.
Genomics. 2013 Oct;102(4):202-8. doi: 10.1016/j.ygeno.2013.07.010. Epub 2013 Jul 25.
6
Extracting between-pathway models from E-MAP interactions using expected graph compression.
J Comput Biol. 2011 Mar;18(3):379-90. doi: 10.1089/cmb.2010.0268.
8
Gene expression complex networks: synthesis, identification, and analysis.
J Comput Biol. 2011 Oct;18(10):1353-67. doi: 10.1089/cmb.2010.0118. Epub 2011 May 6.

引用本文的文献

1
Application of network link prediction in drug discovery.
BMC Bioinformatics. 2021 Apr 12;22(1):187. doi: 10.1186/s12859-021-04082-y.
2
Exploring genetic interaction manifolds constructed from rich single-cell phenotypes.
Science. 2019 Aug 23;365(6455):786-793. doi: 10.1126/science.aax4438. Epub 2019 Aug 8.
3
Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities.
Inf Fusion. 2019 Oct;50:71-91. doi: 10.1016/j.inffus.2018.09.012. Epub 2018 Sep 21.
4
Prediction of Genetic Interactions Using Machine Learning and Network Properties.
Front Bioeng Biotechnol. 2015 Oct 26;3:172. doi: 10.3389/fbioe.2015.00172. eCollection 2015.

本文引用的文献

1
A lipid E-MAP identifies Ubx2 as a critical regulator of lipid saturation and lipid bilayer stress.
Mol Cell. 2013 Aug 22;51(4):519-30. doi: 10.1016/j.molcel.2013.06.014. Epub 2013 Jul 25.
2
Predicting protein-protein interactions from multimodal biological data sources via nonnegative matrix tri-factorization.
J Comput Biol. 2013 Apr;20(4):344-58. doi: 10.1089/cmb.2012.0273. Epub 2013 Mar 19.
3
Simultaneous non-negative matrix factorization for multiple large scale gene expression datasets in toxicology.
PLoS One. 2012;7(12):e48238. doi: 10.1371/journal.pone.0048238. Epub 2012 Dec 14.
4
Multiple genetic interaction experiments provide complementary information useful for gene function prediction.
PLoS Comput Biol. 2012;8(6):e1002559. doi: 10.1371/journal.pcbi.1002559. Epub 2012 Jun 21.
5
A classifier ensemble approach for the missing feature problem.
Artif Intell Med. 2012 May;55(1):37-50. doi: 10.1016/j.artmed.2011.11.006. Epub 2011 Dec 20.
6
Imputing and predicting quantitative genetic interactions in epistatic MAPs.
Methods Mol Biol. 2011;781:353-61. doi: 10.1007/978-1-61779-276-2_17.
7
Towards better accuracy for missing value estimation of epistatic miniarray profiling data by a novel ensemble approach.
Genomics. 2011 May;97(5):257-64. doi: 10.1016/j.ygeno.2011.03.001. Epub 2011 Mar 21.
10
An integrative multi-network and multi-classifier approach to predict genetic interactions.
PLoS Comput Biol. 2010 Sep 9;6(9):e1000928. doi: 10.1371/journal.pcbi.1000928.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验