高斯图模型中的异质潜在转移学习。

Heterogeneous latent transfer learning in Gaussian graphical models.

机构信息

Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, 19104, United States.

The Center for Health AI and Synthesis of Evidence (CHASE), The University of Pennsylvania, Philadelphia, PA, 19104, United States.

出版信息

Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae096.

DOI:10.1093/biomtc/ujae096

PMID:39302138

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11413907/

Abstract

Gaussian graphical models (GGMs) are useful for understanding the complex relationships between biological entities. Transfer learning can improve the estimation of GGMs in a target dataset by incorporating relevant information from related source studies. However, biomedical research often involves intrinsic and latent heterogeneity within a study, such as heterogeneous subpopulations. This heterogeneity can make it difficult to identify informative source studies or lead to negative transfer if the source study is improperly used. To address this challenge, we developed a heterogeneous latent transfer learning (Latent-TL) approach that accounts for both within-sample and between-sample heterogeneity. The idea behind this approach is to "learn from the alike" by leveraging the similarities between source and target GGMs within each subpopulation. The Latent-TL algorithm simultaneously identifies common subpopulation structures among samples and facilitates the learning of target GGMs using source samples from the same subpopulation. Through extensive simulations and real data application, we have shown that the proposed method outperforms single-site learning and standard transfer learning that ignores the latent structures. We have also demonstrated the applicability of the proposed algorithm in characterizing gene co-expression networks in breast cancer patients, where the inferred genetic networks identified many biologically meaningful gene-gene interactions.

摘要

高斯图形模型（GGMs）可用于理解生物实体之间的复杂关系。迁移学习可以通过整合相关来源研究的相关信息来提高目标数据集中 GGM 的估计。然而，生物医学研究通常涉及研究内部的固有和潜在异质性，例如异质亚群。这种异质性可能难以识别信息丰富的源研究，或者如果源研究使用不当，可能导致负迁移。为了解决这一挑战，我们开发了一种异构潜在迁移学习（Latent-TL）方法，该方法考虑了样本内和样本间的异质性。这种方法的思路是通过利用源和目标 GGM 之间的相似性，从相似的方面“学习”。Latent-TL 算法同时识别样本之间的常见亚群结构，并利用来自同一亚群的源样本促进目标 GGM 的学习。通过广泛的模拟和真实数据应用，我们表明所提出的方法优于单站点学习和忽略潜在结构的标准迁移学习。我们还证明了所提出的算法在表征乳腺癌患者基因共表达网络中的适用性，其中推断的遗传网络确定了许多具有生物学意义的基因-基因相互作用。

相似文献

Heterogeneous latent transfer learning in Gaussian graphical models.高斯图模型中的异质潜在转移学习。

Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae096.

Short-Term Memory Impairment短期记忆障碍

Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果：面向临床医生的网状Meta分析教程

Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果：来自系统评价和意大利医院数据评估的证据]

Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.

Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤

Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.

Variation within and between digital pathology and light microscopy for the diagnosis of histopathology slides: blinded crossover comparison study.数字病理学与光学显微镜检查在组织病理学切片诊断中的内部及相互间差异：双盲交叉对比研究

Health Technol Assess. 2025 Jul;29(30):1-75. doi: 10.3310/SPLK4325.

The effectiveness of using non-traditional teaching methods to prepare student health care professionals for the delivery of mental state examination: a systematic review.使用非传统教学方法培养学生医护专业人员进行精神状态检查的有效性：一项系统综述。

JBI Database System Rev Implement Rep. 2015 Aug 14;13(7):177-212. doi: 10.11124/jbisrir-2015-2263.

Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA？一项初步评估。

Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.

本文引用的文献

Transfer Learning under High-dimensional Generalized Linear Models.高维广义线性模型下的迁移学习

J Am Stat Assoc. 2023;118(544):2684-2697. doi: 10.1080/01621459.2022.2071278. Epub 2022 Jun 27.

Transfer Learning in Large-scale Gaussian Graphical Models with False Discovery Rate Control.具有错误发现率控制的大规模高斯图形模型中的迁移学习

J Am Stat Assoc. 2023;118(543):2171-2183. doi: 10.1080/01621459.2022.2044333. Epub 2022 Mar 18.

Transfer Learning for High-Dimensional Linear Regression: Prediction, Estimation and Minimax Optimality.高维线性回归的迁移学习：预测、估计与极小极大最优性

J R Stat Soc Series B Stat Methodol. 2022 Feb;84(1):149-173. doi: 10.1111/rssb.12479. Epub 2021 Nov 16.

FDA-Approved and Emerging Next Generation Predictive Biomarkers for Immune Checkpoint Inhibitors in Cancer Patients.美国食品药品监督管理局批准的以及正在兴起的用于癌症患者免疫检查点抑制剂的下一代预测生物标志物。

Front Oncol. 2021 Jun 7;11:683419. doi: 10.3389/fonc.2021.683419. eCollection 2021.

Luminal A Breast Cancer Co-expression Network: Structural and Functional Alterations.管腔A型乳腺癌共表达网络：结构与功能改变

Front Genet. 2021 Apr 20;12:629475. doi: 10.3389/fgene.2021.629475. eCollection 2021.

Gaussian graphical model-based heterogeneity analysis via penalized fusion.基于高斯图模型的惩罚融合异质性分析。

Biometrics. 2022 Jun;78(2):524-535. doi: 10.1111/biom.13426. Epub 2021 Feb 5.

The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.2021 年的 STRING 数据库：可定制的蛋白质-蛋白质网络，以及用户上传的基因/测量集的功能特征分析。

Nucleic Acids Res. 2021 Jan 8;49(D1):D605-D612. doi: 10.1093/nar/gkaa1074.

Proteogenomic Landscape of Breast Cancer Tumorigenesis and Targeted Therapy.乳腺癌发生和靶向治疗的蛋白质基因组全景分析

Cell. 2020 Nov 25;183(5):1436-1456.e31. doi: 10.1016/j.cell.2020.10.036. Epub 2020 Nov 18.

Heterogeneity of response to immune checkpoint blockade in hypermutated experimental gliomas.免疫检查点阻断在高突变实验性脑胶质瘤中的反应异质性。

Nat Commun. 2020 Feb 18;11(1):931. doi: 10.1038/s41467-020-14642-0.

Identification of prognostic significance of BIRC5 in breast cancer using integrative bioinformatics analysis.利用综合生物信息学分析鉴定乳腺癌中 BIRC5 的预后意义。

Biosci Rep. 2020 Feb 28;40(2). doi: 10.1042/BSR20193678.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验