• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

本质回归:一种从多组学数据集中推断因果潜在因素的可推广框架。

Essential Regression: A generalizable framework for inferring causal latent factors from multi-omic datasets.

作者信息

Bing Xin, Lovelace Tyler, Bunea Florentina, Wegkamp Marten, Kasturi Sudhir Pai, Singh Harinder, Benos Panayiotis V, Das Jishnu

机构信息

Department of Statistics and Data Science, Cornell University, Ithaca, NY, USA.

Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA.

出版信息

Patterns (N Y). 2022 Mar 24;3(5):100473. doi: 10.1016/j.patter.2022.100473. eCollection 2022 May 13.

DOI:10.1016/j.patter.2022.100473
PMID:35607614
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9122954/
Abstract

High-dimensional cellular and molecular profiling of biological samples highlights the need for analytical approaches that can integrate multi-omic datasets to generate prioritized causal inferences. Current methods are limited by high dimensionality of the combined datasets, the differences in their data distributions, and their integration to infer causal relationships. Here, we present Essential Regression (ER), a novel latent-factor-regression-based interpretable machine-learning approach that addresses these problems by identifying latent factors and their likely cause-effect relationships with system-wide outcomes/properties of interest. ER can integrate many multi-omic datasets without structural or distributional assumptions regarding the data. It outperforms a range of state-of-the-art methods in terms of prediction. ER can be coupled with probabilistic graphical modeling, thereby strengthening the causal inferences. The utility of ER is demonstrated using multi-omic system immunology datasets to generate and validate novel cellular and molecular inferences in a wide range of contexts including immunosenescence and immune dysregulation.

摘要

生物样本的高维细胞和分子分析凸显了对能够整合多组学数据集以生成优先因果推断的分析方法的需求。当前方法受到组合数据集的高维度、数据分布差异以及用于推断因果关系的整合的限制。在此,我们提出了基本回归(ER),这是一种基于潜在因素回归的新型可解释机器学习方法,它通过识别潜在因素及其与感兴趣的全系统结果/属性可能的因果关系来解决这些问题。ER可以整合许多多组学数据集,而无需对数据进行结构或分布假设。在预测方面,它优于一系列最先进的方法。ER可以与概率图形建模相结合,从而加强因果推断。使用多组学系统免疫学数据集在包括免疫衰老和免疫失调在内的广泛背景下生成并验证新的细胞和分子推断,证明了ER的实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/1b45d0f1b308/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/0a6d0a1aa089/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/b57708397259/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/01aa981570dc/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/08aa409ead13/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/375d064b70c5/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/1b45d0f1b308/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/0a6d0a1aa089/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/b57708397259/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/01aa981570dc/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/08aa409ead13/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/375d064b70c5/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/9122954/1b45d0f1b308/gr5.jpg

相似文献

1
Essential Regression: A generalizable framework for inferring causal latent factors from multi-omic datasets.本质回归:一种从多组学数据集中推断因果潜在因素的可推广框架。
Patterns (N Y). 2022 Mar 24;3(5):100473. doi: 10.1016/j.patter.2022.100473. eCollection 2022 May 13.
2
SLIDE: Significant Latent Factor Interaction Discovery and Exploration across biological domains.幻灯片:跨生物领域的显著潜在因子交互发现与探索。
Nat Methods. 2024 May;21(5):835-845. doi: 10.1038/s41592-024-02175-z. Epub 2024 Feb 19.
3
Unfolding and de-confounding: biologically meaningful causal inference from longitudinal multi-omic networks using METALICA.展开与去混淆:利用 METALICA 从纵向多组学网络中进行具有生物学意义的因果推断
mSystems. 2024 Oct 22;9(10):e0130323. doi: 10.1128/msystems.01303-23. Epub 2024 Sep 6.
4
Unfolding and De-confounding: Biologically meaningful causal inference from longitudinal multi-omic networks using METALICA.展开与去混淆:使用METALICA从纵向多组学网络进行具有生物学意义的因果推断
bioRxiv. 2023 Dec 13:2023.12.12.571384. doi: 10.1101/2023.12.12.571384.
5
Bayesian Multi-View Clustering given complex inter-view structure.贝叶斯多视图聚类,给定复杂的视图间结构。
F1000Res. 2024 Feb 29;11:1460. doi: 10.12688/f1000research.126215.2. eCollection 2022.
6
Causal Discovery in High-dimensional, Multicollinear Datasets.高维、多重共线性数据集中的因果发现
Front Epidemiol. 2022;2. doi: 10.3389/fepid.2022.899655. Epub 2022 Sep 13.
7
MPAC: a computational framework for inferring pathway activities from multi-omic data.MPAC:一个用于从多组学数据推断通路活性的计算框架。
bioRxiv. 2025 Mar 31:2024.06.15.599113. doi: 10.1101/2024.06.15.599113.
8
Computational approaches leveraging integrated connections of multi-omic data toward clinical applications.利用多组学数据的整合连接实现临床应用的计算方法。
Mol Omics. 2022 Jan 17;18(1):7-18. doi: 10.1039/d1mo00158b.
9
Multi-project and Multi-profile joint Non-negative Matrix Factorization for cancer omic datasets.多项目多谱联合非负矩阵分解在癌症组学数据集上的应用。
Bioinformatics. 2021 Dec 11;37(24):4801-4809. doi: 10.1093/bioinformatics/btab579.
10
Interpretable multi-omics integration with UMAP embeddings and density-based clustering.基于UMAP嵌入和密度聚类的可解释多组学整合
bioRxiv. 2024 Oct 11:2024.10.07.617035. doi: 10.1101/2024.10.07.617035.

引用本文的文献

1
Deep humoral profiling coupled to interpretable machine learning unveils diagnostic markers and pathophysiology of schistosomiasis.深度体液分析结合可解释的机器学习揭示了血吸虫病的诊断标志物和病理生理学。
Sci Transl Med. 2024 Sep 18;16(765):eadk7832. doi: 10.1126/scitranslmed.adk7832.
2
Data-Driven Approaches to Predict Dendrimer Cytotoxicity.预测树枝状大分子细胞毒性的数据驱动方法。
ACS Omega. 2024 May 27;9(23):24899-24906. doi: 10.1021/acsomega.4c01775. eCollection 2024 Jun 11.
3
SLIDE: Significant Latent Factor Interaction Discovery and Exploration across biological domains.

本文引用的文献

1
A yeast expressed RBD-based SARS-CoV-2 vaccine formulated with 3M-052-alum adjuvant promotes protective efficacy in non-human primates.一种使用 3M-052 佐剂配制的基于酵母表达 RBD 的 SARS-CoV-2 疫苗可在非人类灵长类动物中促进保护效力。
Sci Immunol. 2021 Jul 15;6(61). doi: 10.1126/sciimmunol.abh3634.
2
Causal integration of multi-omics data with prior knowledge to generate mechanistic hypotheses.基于先验知识整合多组学数据以生成机制假设。
Mol Syst Biol. 2021 Jan;17(1):e9730. doi: 10.15252/msb.20209730.
3
Mining for humoral correlates of HIV control and latent reservoir size.
幻灯片:跨生物领域的显著潜在因子交互发现与探索。
Nat Methods. 2024 May;21(5):835-845. doi: 10.1038/s41592-024-02175-z. Epub 2024 Feb 19.
4
From bench to bedside via bytes: Multi-omic immunoprofiling and integration using machine learning and network approaches.从实验室到病床:利用机器学习和网络方法进行多组学免疫分析和整合。
Hum Vaccin Immunother. 2023 Dec 15;19(3):2282803. doi: 10.1080/21645515.2023.2282803.
5
Causal Discovery in High-dimensional, Multicollinear Datasets.高维、多重共线性数据集中的因果发现
Front Epidemiol. 2022;2. doi: 10.3389/fepid.2022.899655. Epub 2022 Sep 13.
6
Determining asthma endotypes and outcomes: Complementing existing clinical practice with modern machine learning.确定哮喘表型和结局:用现代机器学习方法补充现有临床实践。
Cell Rep Med. 2022 Dec 20;3(12):100857. doi: 10.1016/j.xcrm.2022.100857.
7
High Dimensional Multiomics Reveals Unique Characteristics of Early Plasma Administration in Polytrauma Patients With TBI.高维多组学揭示了 TBI 多发伤患者早期血浆治疗的独特特征。
Ann Surg. 2022 Oct 1;276(4):673-683. doi: 10.1097/SLA.0000000000005610. Epub 2022 Jul 19.
挖掘与 HIV 控制和潜伏储存大小相关的体液标志物。
PLoS Pathog. 2020 Oct 13;16(10):e1008868. doi: 10.1371/journal.ppat.1008868. eCollection 2020 Oct.
4
Mapping functional humoral correlates of protection against malaria challenge following RTS,S/AS01 vaccination.绘制 RTS,S/AS01 疫苗接种后针对疟疾挑战的功能性体液保护相关性图谱。
Sci Transl Med. 2020 Jul 22;12(553). doi: 10.1126/scitranslmed.abb4757.
5
MOFA+: a statistical framework for comprehensive integration of multi-modal single-cell data.MOFA+:一种全面整合多模态单细胞数据的统计框架。
Genome Biol. 2020 May 11;21(1):111. doi: 10.1186/s13059-020-02015-1.
6
CausalMGM: an interactive web-based causal discovery tool.因果推断工具 CausalMGM:一款交互式网页工具
Nucleic Acids Res. 2020 Jul 2;48(W1):W597-W602. doi: 10.1093/nar/gkaa350.
7
Antibody Fc Glycosylation Discriminates Between Latent and Active Tuberculosis.抗体 Fc 糖基化可区分潜伏性和活动性结核病。
J Infect Dis. 2020 Nov 13;222(12):2093-2102. doi: 10.1093/infdis/jiz643.
8
Evaluation of Causal Structure Learning Methods on Mixed Data Types.混合数据类型下因果结构学习方法的评估
Proc Mach Learn Res. 2018 Aug;92:48-65.
9
Feasibility of lung cancer prediction from low-dose CT scan and smoking factors using causal models.利用因果模型从低剂量 CT 扫描和吸烟因素预测肺癌的可行性。
Thorax. 2019 Jul;74(7):643-649. doi: 10.1136/thoraxjnl-2018-212638. Epub 2019 Mar 12.
10
PARP1 rs1805407 Increases Sensitivity to PARP1 Inhibitors in Cancer Cells Suggesting an Improved Therapeutic Strategy.PARP1 rs1805407 增加癌细胞对 PARP1 抑制剂的敏感性,提示改善治疗策略。
Sci Rep. 2019 Mar 1;9(1):3309. doi: 10.1038/s41598-019-39542-2.