• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

链接矩阵分解

Linked matrix factorization.

作者信息

O'Connell Michael J, Lock Eric F

机构信息

Department of Statistics, Miami University, Oxford, Ohio 45056.

Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota 55455.

出版信息

Biometrics. 2019 Jun;75(2):582-592. doi: 10.1111/biom.13010. Epub 2019 Apr 2.

DOI:10.1111/biom.13010
PMID:30516272
Abstract

Several recent methods address the dimension reduction and decomposition of linked high-content data matrices. Typically, these methods consider one dimension, rows or columns, that is shared among the matrices. This shared dimension may represent common features measured for different sample sets (horizontal integration) or a common sample set with features from different platforms (vertical integration). We introduce an approach for simultaneous horizontal and vertical integration, Linked Matrix Factorization (LMF), for the general case where some matrices share rows (e.g., features) and some share columns (e.g., samples). Our motivating application is a cytotoxicity study with accompanying genomic and molecular chemical attribute data. The toxicity matrix (cell lines chemicals) shares samples with a genotype matrix (cell lines SNPs) and shares features with a molecular attribute matrix (chemicals attributes). LMF gives a unified low-rank factorization of these three matrices, which allows for the decomposition of systematic variation that is shared and systematic variation that is specific to each matrix. This allows for efficient dimension reduction, exploratory visualization, and the imputation of missing data even when entire rows or columns are missing. We present theoretical results concerning the uniqueness, identifiability, and minimal parametrization of LMF, and evaluate it with extensive simulation studies.

摘要

最近有几种方法用于处理链接的高内涵数据矩阵的降维和分解。通常,这些方法考虑矩阵之间共享的一个维度,行或列。这个共享维度可能代表针对不同样本集测量的共同特征(水平整合),或者具有来自不同平台特征的共同样本集(垂直整合)。我们针对一些矩阵共享行(例如,特征)而一些矩阵共享列(例如,样本)的一般情况,引入了一种用于同时进行水平和垂直整合的方法,即链接矩阵分解(LMF)。我们的激励应用是一项伴随基因组和分子化学属性数据的细胞毒性研究。毒性矩阵(细胞系×化学物质)与基因型矩阵(细胞系×单核苷酸多态性)共享样本,并与分子属性矩阵(化学物质×属性)共享特征。LMF对这三个矩阵进行统一的低秩分解,这允许对共享的系统变异和每个矩阵特有的系统变异进行分解。即使当整行或整列缺失时,这也允许进行有效的降维、探索性可视化以及缺失数据的插补。我们给出了关于LMF的唯一性、可识别性和最小参数化的理论结果,并用广泛的模拟研究对其进行了评估。

相似文献

1
Linked matrix factorization.链接矩阵分解
Biometrics. 2019 Jun;75(2):582-592. doi: 10.1111/biom.13010. Epub 2019 Apr 2.
2
BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.用于泛组学全癌分析的二维链接矩阵分解
Ann Appl Stat. 2022 Mar;16(1):193-215. doi: 10.1214/21-AOAS1495. Epub 2022 Mar 28.
3
Integrative factorization of bidimensionally linked matrices.二维关联矩阵的综合分解。
Biometrics. 2020 Mar;76(1):61-74. doi: 10.1111/biom.13141. Epub 2019 Nov 10.
4
TRANSPOSABLE REGULARIZED COVARIANCE MODELS WITH AN APPLICATION TO MISSING DATA IMPUTATION.具有缺失数据插补应用的可转置正则化协方差模型。
Ann Appl Stat. 2010 Jun;4(2):764-790. doi: 10.1214/09-AOAS314.
5
Hierarchical nuclear norm penalization for multi-view data integration.层次核范数惩罚多视图数据集成。
Biometrics. 2023 Dec;79(4):2933-2946. doi: 10.1111/biom.13893. Epub 2023 Jun 22.
6
DeepMF: deciphering the latent patterns in omics profiles with a deep learning method.DeepMF:一种利用深度学习方法解析组学图谱中潜在模式的方法。
BMC Bioinformatics. 2019 Dec 27;20(Suppl 23):648. doi: 10.1186/s12859-019-3291-6.
7
Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework.多组学数据整合中缺失行的处理:多因素分析框架下的多重填补
BMC Bioinformatics. 2016 Oct 3;17(1):402. doi: 10.1186/s12859-016-1273-5.
8
Multiple augmented reduced rank regression for pan-cancer analysis.多组增强降秩回归分析泛癌数据。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad002.
9
HiPiler: Visual Exploration of Large Genome Interaction Matrices with Interactive Small Multiples.HiPiler:使用交互式小多图可视化探索大型基因组互作矩阵
IEEE Trans Vis Comput Graph. 2018 Jan;24(1):522-531. doi: 10.1109/TVCG.2017.2745978. Epub 2017 Aug 29.
10
Kernelized Bayesian Matrix Factorization.核化贝叶斯矩阵分解。
IEEE Trans Pattern Anal Mach Intell. 2014 Oct;36(10):2047-60. doi: 10.1109/TPAMI.2014.2313125.

引用本文的文献

1
Dimension-wise sparse low-rank approximation of a matrix with application to variable selection in high-dimensional integrative analyzes of association.矩阵的维度稀疏低秩逼近及其在高维关联综合分析中的变量选择应用
J Appl Stat. 2021 Aug 19;49(15):3889-3907. doi: 10.1080/02664763.2021.1967892. eCollection 2022.
2
BAYESIAN JOINT MODELING OF CHEMICAL STRUCTURE AND DOSE RESPONSE CURVES.化学结构与剂量反应曲线的贝叶斯联合建模
Ann Appl Stat. 2021 Sep;15(3):1405-1430. doi: 10.1214/21-aoas1461. Epub 2021 Sep 23.
3
BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.
用于泛组学全癌分析的二维链接矩阵分解
Ann Appl Stat. 2022 Mar;16(1):193-215. doi: 10.1214/21-AOAS1495. Epub 2022 Mar 28.
4
Integrative factorization of bidimensionally linked matrices.二维关联矩阵的综合分解。
Biometrics. 2020 Mar;76(1):61-74. doi: 10.1111/biom.13141. Epub 2019 Nov 10.