Suppr超能文献

幻灯片:跨生物领域的显著潜在因子交互发现与探索。

SLIDE: Significant Latent Factor Interaction Discovery and Exploration across biological domains.

机构信息

Center for Systems Immunology, Departments of Immunology and Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA.

Joint CMU-Pitt PhD Program in Computational Biology, Pittsburgh, PA, USA.

出版信息

Nat Methods. 2024 May;21(5):835-845. doi: 10.1038/s41592-024-02175-z. Epub 2024 Feb 19.

Abstract

Modern multiomic technologies can generate deep multiscale profiles. However, differences in data modalities, multicollinearity of the data, and large numbers of irrelevant features make analyses and integration of high-dimensional omic datasets challenging. Here we present Significant Latent Factor Interaction Discovery and Exploration (SLIDE), a first-in-class interpretable machine learning technique for identifying significant interacting latent factors underlying outcomes of interest from high-dimensional omic datasets. SLIDE makes no assumptions regarding data-generating mechanisms, comes with theoretical guarantees regarding identifiability of the latent factors/corresponding inference, and has rigorous false discovery rate control. Using SLIDE on single-cell and spatial omic datasets, we uncovered significant interacting latent factors underlying a range of molecular, cellular and organismal phenotypes. SLIDE outperforms/performs at least as well as a wide range of state-of-the-art approaches, including other latent factor approaches. More importantly, it provides biological inference beyond prediction that other methods do not afford. Thus, SLIDE is a versatile engine for biological discovery from modern multiomic datasets.

摘要

现代多组学技术可以生成深度多尺度图谱。然而,数据模态的差异、数据的多重共线性以及大量不相关的特征使得分析和整合高维组学数据集具有挑战性。在这里,我们提出了 Significant Latent Factor Interaction Discovery and Exploration (SLIDE),这是一种首创的可解释机器学习技术,用于从高维组学数据集中识别与感兴趣结果相关的显著相互作用的潜在因素。SLIDE 对数据生成机制没有任何假设,对潜在因素/相应推断的可识别性有理论保证,并且具有严格的错误发现率控制。在单细胞和空间组学数据集上使用 SLIDE,我们发现了一系列分子、细胞和机体表型背后存在显著相互作用的潜在因素。SLIDE 的性能优于/至少与包括其他潜在因素方法在内的广泛的最先进方法相当。更重要的是,它提供了其他方法无法提供的超越预测的生物学推断。因此,SLIDE 是从现代多组学数据中进行生物发现的通用引擎。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0ae/11588359/f079ab03c62f/nihms-2034692-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验