GC[公式：见正文]NMF：一种用于基因-表型关联预测的新型矩阵分解框架。

GC[Formula: see text]NMF: A Novel Matrix Factorization Framework for Gene-Phenotype Association Prediction.

机构信息

College of Software, NanKai University, TianJin, 300071, China.

Computer Science and Information Engineering, Tianjin University of Science and Technology, TianJin, 300222, China.

出版信息

Interdiscip Sci. 2018 Sep;10(3):572-582. doi: 10.1007/s12539-018-0296-1. Epub 2018 Apr 24.

DOI:10.1007/s12539-018-0296-1

PMID:29691712

Abstract

Gene-phenotype association prediction can be applied to reveal the inherited basis of human diseases and facilitate drug development. Gene-phenotype associations are related to complex biological processes and influenced by various factors, such as relationship between phenotypes and that among genes. While due to sparseness of curated gene-phenotype associations and lack of integrated analysis of the joint effect of multiple factors, existing applications are limited to prediction accuracy and potential gene-phenotype association detection. In this paper, we propose a novel method by exploiting weighted graph constraint learned from hierarchical structures of phenotype data and group prior information among genes by inheriting advantages of Non-negative Matrix Factorization (NMF), called Weighted Graph Constraint and Group Centric Non-negative Matrix Factorization (GC[Formula: see text]NMF). Specifically, first we introduce the depth of parent-child relationships between two adjacent phenotypes in hierarchical phenotypic data as weighted graph constraint for a better phenotype understanding. Second, we utilize intra-group correlation among genes in a gene group as group constraint for gene understanding. Such information provides us with the intuition that genes in a group probably result in similar phenotypes. The model not only allows us to achieve a high-grade prediction performance, but also helps us to learn interpretable representation of genes and phenotypes simultaneously to facilitate future biological analysis. Experimental results on biological gene-phenotype association datasets of mouse and human demonstrate that GC[Formula: see text]NMF can obtain superior prediction accuracy and good understandability for biological explanation over other state-of-the-arts methods.

摘要

基因-表型关联预测可用于揭示人类疾病的遗传基础，促进药物研发。基因-表型关联与复杂的生物过程有关，并受到多种因素的影响，如表型之间和基因之间的关系。然而，由于已注释的基因-表型关联稀疏，以及缺乏对多个因素联合效应的综合分析，现有的应用仅限于预测准确性和潜在的基因-表型关联检测。在本文中，我们提出了一种新的方法，通过利用从层次结构数据和基因之间的组先验信息中学习到的加权图约束来继承非负矩阵分解（NMF）的优势，称为加权图约束和基于群组的非负矩阵分解（GC[Formula: see text]NMF）。具体来说，首先，我们在层次化的表型数据中引入两个相邻表型之间的父子关系深度作为加权图约束，以更好地理解表型。其次，我们利用基因组中基因之间的组内相关性作为组约束，以了解基因。这些信息使我们产生了一个直观的认识，即一个基因组中的基因可能导致相似的表型。该模型不仅可以实现高等级的预测性能，还可以帮助我们同时学习可解释的基因和表型表示，以便于未来的生物学分析。在小鼠和人类的生物基因-表型关联数据集上的实验结果表明，GC[Formula: see text]NMF 可以在其他现有方法之上获得卓越的预测准确性和良好的生物学解释可理解性。

相似文献

GC[Formula: see text]NMF: A Novel Matrix Factorization Framework for Gene-Phenotype Association Prediction.

Interdiscip Sci. 2018 Sep;10(3):572-582. doi: 10.1007/s12539-018-0296-1. Epub 2018 Apr 24.

Metrical Consistency NMF for Predicting Gene-Phenotype Associations.

Interdiscip Sci. 2018 Mar;10(1):189-194. doi: 10.1007/s12539-017-0224-9. Epub 2017 Apr 8.

Data representation using robust nonnegative matrix factorization for edge computing.

Math Biosci Eng. 2022 Jan;19(2):2147-2178. doi: 10.3934/mbe.2022100. Epub 2021 Dec 28.

Tumor clustering using nonnegative matrix factorization with gene selection.

IEEE Trans Inf Technol Biomed. 2009 Jul;13(4):599-607. doi: 10.1109/TITB.2009.2018115. Epub 2009 Apr 14.

Nonnegative Matrix Factorization with Rank Regularization and Hard Constraint.

Neural Comput. 2017 Sep;29(9):2553-2579. doi: 10.1162/neco_a_00995. Epub 2017 Aug 4.

Manifold regularized discriminative nonnegative matrix factorization with fast gradient descent.

IEEE Trans Image Process. 2011 Jul;20(7):2030-48. doi: 10.1109/TIP.2011.2105496. Epub 2011 Jan 13.

Probabilistic non-negative matrix factorization: theory and application to microarray data analysis.

J Bioinform Comput Biol. 2014 Feb;12(1):1450001. doi: 10.1142/S0219720014500012. Epub 2014 Jan 9.

THz spectral data analysis and components unmixing based on non-negative matrix factorization methods.

Spectrochim Acta A Mol Biomol Spectrosc. 2017 Apr 15;177:49-57. doi: 10.1016/j.saa.2017.01.009. Epub 2017 Jan 4.

Learning Microbial Community Structures with Supervised and Unsupervised Non-negative Matrix Factorization.

Microbiome. 2017 Aug 31;5(1):110. doi: 10.1186/s40168-017-0323-1.

Robust Bi-Stochastic Graph Regularized Matrix Factorization for Data Clustering.

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):390-403. doi: 10.1109/TPAMI.2020.3007673. Epub 2021 Dec 7.

引用本文的文献

Mining functional gene modules by multi-view NMF of phenome-genome association.

BMC Genomics. 2025 Jan 9;23(Suppl 6):868. doi: 10.1186/s12864-024-11120-5.

STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity.

Front Bioeng Biotechnol. 2019 Nov 6;7:306. doi: 10.3389/fbioe.2019.00306. eCollection 2019.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

GC[公式：见正文]NMF：一种用于基因-表型关联预测的新型矩阵分解框架。

GC[Formula: see text]NMF: A Novel Matrix Factorization Framework for Gene-Phenotype Association Prediction.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献