基于高斯图模型的惩罚融合异质性分析。

Gaussian graphical model-based heterogeneity analysis via penalized fusion.

机构信息

School of Mathematics Sciences, University of Chinese Academy of Sciences, Beijing, China.

Key Laboratory of Big Data Mining and Knowledge Management, Chinese Academy of Sciences, Beijing, China.

出版信息

Biometrics. 2022 Jun;78(2):524-535. doi: 10.1111/biom.13426. Epub 2021 Feb 5.

DOI:10.1111/biom.13426

PMID:33501648

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9003628/

Abstract

Heterogeneity is a hallmark of cancer, diabetes, cardiovascular diseases, and many other complex diseases. This study has been partly motivated by the unsupervised heterogeneity analysis for complex diseases based on molecular and imaging data, for which, network-based analysis, by accommodating the interconnections among variables, can be more informative than that limited to mean, variance, and other simple distributional properties. In the literature, there has been very limited research on network-based heterogeneity analysis, and a common limitation shared by the existing techniques is that the number of subgroups needs to be specified a priori or in an ad hoc manner. In this article, we develop a penalized fusion approach for heterogeneity analysis based on the Gaussian graphical model. It applies penalization to the mean and precision matrix parameters to generate regularized and interpretable estimates. More importantly, a fusion penalty is imposed to "automatedly" determine the number of subgroups and generate more concise, reliable, and interpretable estimation. Consistency properties are rigorously established, and an effective computational algorithm is developed. The heterogeneity analysis of non-small-cell lung cancer based on single-cell gene expression data of the Wnt pathway and that of lung adenocarcinoma based on histopathological imaging data not only demonstrate the practical applicability of the proposed approach but also lead to interesting new findings.

摘要

异质性是癌症、糖尿病、心血管疾病和许多其他复杂疾病的标志。本研究的部分动机是基于分子和成像数据对复杂疾病进行无监督的异质性分析，基于网络的分析通过容纳变量之间的相互连接，可以比仅限于均值、方差和其他简单分布特性的分析更具信息量。在文献中，基于网络的异质性分析的研究非常有限，现有技术的一个共同局限性是需要事先或以特定方式指定子组的数量。在本文中，我们开发了一种基于高斯图形模型的基于惩罚的融合方法用于异质性分析。它对均值和精度矩阵参数施加惩罚，以生成正则化和可解释的估计。更重要的是，施加融合惩罚以“自动”确定子组的数量，并生成更简洁、可靠和可解释的估计。严格建立了一致性性质，并开发了一种有效的计算算法。基于 Wnt 通路的非小细胞肺癌的单细胞基因表达数据和基于组织病理学成像数据的肺腺癌的异质性分析不仅证明了所提出方法的实际适用性，而且还得出了有趣的新发现。

相似文献

Gaussian graphical model-based heterogeneity analysis via penalized fusion.基于高斯图模型的惩罚融合异质性分析。

Biometrics. 2022 Jun;78(2):524-535. doi: 10.1111/biom.13426. Epub 2021 Feb 5.

HeteroGGM: an R package for Gaussian graphical model-based heterogeneity analysis.HeteroGGM：一个基于高斯图形模型的异质性分析的 R 包。

Bioinformatics. 2021 Sep 29;37(18):3073-3074. doi: 10.1093/bioinformatics/btab134.

Histopathological imaging-based cancer heterogeneity analysis via penalized fusion with model averaging.基于病理图像的癌症异质性分析，通过惩罚融合与模型平均化。

Biometrics. 2021 Dec;77(4):1397-1408. doi: 10.1111/biom.13357. Epub 2020 Aug 29.

HETEROGENEITY ANALYSIS VIA INTEGRATING MULTI-SOURCES HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO CANCER STUDIES.通过整合多源高维数据进行异质性分析及其在癌症研究中的应用

Stat Sin. 2023 Apr;33(2):729-758. doi: 10.5705/ss.202021.0002.

Robust Gaussian graphical modeling via l1 penalization.通过 l1 惩罚实现稳健的高斯图形模型。

Biometrics. 2012 Dec;68(4):1197-206. doi: 10.1111/j.1541-0420.2012.01785.x. Epub 2012 Sep 28.

Information-incorporated Gaussian graphical model for gene expression data.基于信息的基因表达数据高斯图模型。

Biometrics. 2022 Jun;78(2):512-523. doi: 10.1111/biom.13428. Epub 2021 Feb 12.

Estimation of multiple networks with common structures in heterogeneous subgroups.异质子组中具有共同结构的多个网络的估计。

J Multivar Anal. 2024 Jul;202. doi: 10.1016/j.jmva.2024.105298. Epub 2024 Feb 13.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Assisted estimation of gene expression graphical models.辅助基因表达图形模型的估计。

Genet Epidemiol. 2021 Jun;45(4):372-385. doi: 10.1002/gepi.22377. Epub 2021 Feb 1.

A Statistical Test for Differential Network Analysis Based on Inference of Gaussian Graphical Model.基于高斯图模型推断的差异网络分析的统计检验

Sci Rep. 2019 Jul 26;9(1):10863. doi: 10.1038/s41598-019-47362-7.

引用本文的文献

AJGM: joint learning of heterogeneous gene networks with adaptive graphical model.AJGM：基于自适应图形模型的异质基因网络联合学习

Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btaf096.

A Selective Review of Network Analysis Methods for Gene Expression Data.基因表达数据网络分析方法的选择性综述。

Methods Mol Biol. 2025;2880:293-307. doi: 10.1007/978-1-0716-4276-4_14.

Heterogeneous latent transfer learning in Gaussian graphical models.高斯图模型中的异质潜在转移学习。

Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae096.

Estimation of multiple networks with common structures in heterogeneous subgroups.异质子组中具有共同结构的多个网络的估计。

J Multivar Anal. 2024 Jul;202. doi: 10.1016/j.jmva.2024.105298. Epub 2024 Feb 13.

Network-based cancer heterogeneity analysis incorporating multi-view of prior information.基于网络的癌症异质性分析，纳入多视图的先验信息。

Bioinformatics. 2022 May 13;38(10):2855-2862. doi: 10.1093/bioinformatics/btac183.

本文引用的文献

Histopathological imaging features- versus molecular measurements-based cancer prognosis modeling.基于组织病理学成像特征与分子测量的癌症预后建模。

Sci Rep. 2020 Sep 14;10(1):15030. doi: 10.1038/s41598-020-72201-5.

Modulation of regulatory T cell function and stability by co-inhibitory receptors.共抑制受体对调节性 T 细胞功能和稳定性的调节。

Nat Rev Immunol. 2020 Nov;20(11):680-693. doi: 10.1038/s41577-020-0296-3. Epub 2020 Apr 8.

Immunological history governs human stem cell memory CD4 heterogeneity via the Wnt signaling pathway.免疫史通过 Wnt 信号通路调控人类干细胞记忆性 CD4 细胞亚群的异质性。

Nat Commun. 2020 Feb 10;11(1):821. doi: 10.1038/s41467-020-14442-6.

Simultaneous Clustering and Estimation of Heterogeneous Graphical Models.异质图形模型的同步聚类与估计

J Mach Learn Res. 2018 Apr;18.

Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing.单细胞测序对非小细胞肺癌 T 细胞的全面刻画。

Nat Med. 2018 Jul;24(7):978-985. doi: 10.1038/s41591-018-0045-3. Epub 2018 Jun 25.

Regulatory T-cell heterogeneity and the cancer immune response.调节性T细胞的异质性与癌症免疫反应。

Clin Transl Immunology. 2017 Sep 15;6(9):e154. doi: 10.1038/cti.2017.43. eCollection 2017 Sep.

Estimation of multiple networks in Gaussian mixture models.高斯混合模型中多个网络的估计

Electron J Stat. 2016;10:1133-1154. doi: 10.1214/16-EJS1135. Epub 2016 May 2.

Human T cell immune surveillance: Phenotypic, functional and migratory heterogeneity for tailored immune responses.人类 T 细胞免疫监视：表型、功能和迁移异质性，以实现定制化的免疫反应。

Immunol Lett. 2017 Oct;190:125-129. doi: 10.1016/j.imlet.2017.08.001. Epub 2017 Aug 4.

Regulatory T cells in cancer immunotherapy.癌症免疫治疗中的调节性T细胞。

Cell Res. 2017 Jan;27(1):109-118. doi: 10.1038/cr.2016.151. Epub 2016 Dec 20.

The joint graphical lasso for inverse covariance estimation across multiple classes.用于跨多个类别的逆协方差估计的联合图形套索法。

J R Stat Soc Series B Stat Methodol. 2014 Mar;76(2):373-397. doi: 10.1111/rssb.12033.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验