• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于可解释的贪心决策森林的多组学疾病模块检测。

Multi-omics disease module detection with an explainable Greedy Decision Forest.

机构信息

Institute for Medical Informatics Statistics and Documentation, Medical University Graz, Graz, Austria.

MI2DataLab, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland.

出版信息

Sci Rep. 2022 Oct 7;12(1):16857. doi: 10.1038/s41598-022-21417-8.

DOI:10.1038/s41598-022-21417-8
PMID:36207536
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9546860/
Abstract

Machine learning methods can detect complex relationships between variables, but usually do not exploit domain knowledge. This is a limitation because in many scientific disciplines, such as systems biology, domain knowledge is available in the form of graphs or networks, and its use can improve model performance. We need network-based algorithms that are versatile and applicable in many research areas. In this work, we demonstrate subnetwork detection based on multi-modal node features using a novel Greedy Decision Forest (GDF) with inherent interpretability. The latter will be a crucial factor to retain experts and gain their trust in such algorithms. To demonstrate a concrete application example, we focus on bioinformatics, systems biology and particularly biomedicine, but the presented methodology is applicable in many other domains as well. Systems biology is a good example of a field in which statistical data-driven machine learning enables the analysis of large amounts of multi-modal biomedical data. This is important to reach the future goal of precision medicine, where the complexity of patients is modeled on a system level to best tailor medical decisions, health practices and therapies to the individual patient. Our proposed explainable approach can help to uncover disease-causing network modules from multi-omics data to better understand complex diseases such as cancer.

摘要

机器学习方法可以检测变量之间的复杂关系,但通常不会利用领域知识。这是一个局限性,因为在许多科学学科中,如系统生物学,领域知识以图或网络的形式存在,并且其使用可以提高模型性能。我们需要具有多功能性和适用于许多研究领域的基于网络的算法。在这项工作中,我们使用具有内在可解释性的新型贪婪决策森林 (GDF) 基于多模态节点特征来演示子网络检测。后者将是保留专家并赢得他们对这些算法信任的关键因素。为了展示一个具体的应用示例,我们专注于生物信息学、系统生物学,特别是生物医学,但所提出的方法也适用于许多其他领域。系统生物学是一个很好的例子,在这个领域中,统计数据驱动的机器学习使我们能够分析大量多模态的生物医学数据。这对于实现精准医学的未来目标很重要,在该目标中,将患者的复杂性建模到系统级别,以便根据个体患者的情况来最好地定制医疗决策、健康实践和治疗。我们提出的可解释方法可以帮助从多组学数据中发现导致疾病的网络模块,以更好地理解癌症等复杂疾病。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/97b7875130fa/41598_2022_21417_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/3546bd1d37eb/41598_2022_21417_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/34703d892a41/41598_2022_21417_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/cbdee8f22789/41598_2022_21417_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/93d1743cb1f1/41598_2022_21417_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/cecaa11304ef/41598_2022_21417_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/b8d1c61b7ef3/41598_2022_21417_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/97b7875130fa/41598_2022_21417_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/3546bd1d37eb/41598_2022_21417_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/34703d892a41/41598_2022_21417_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/cbdee8f22789/41598_2022_21417_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/93d1743cb1f1/41598_2022_21417_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/cecaa11304ef/41598_2022_21417_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/b8d1c61b7ef3/41598_2022_21417_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffef/9546860/97b7875130fa/41598_2022_21417_Fig7_HTML.jpg

相似文献

1
Multi-omics disease module detection with an explainable Greedy Decision Forest.基于可解释的贪心决策森林的多组学疾病模块检测。
Sci Rep. 2022 Oct 7;12(1):16857. doi: 10.1038/s41598-022-21417-8.
2
The Trifecta of Single-Cell, Systems-Biology, and Machine-Learning Approaches.单细胞、系统生物学和机器学习方法的三重奏。
Genes (Basel). 2021 Jul 20;12(7):1098. doi: 10.3390/genes12071098.
3
Advancing drug-response prediction using multi-modal and -omics machine learning integration (MOMLIN): a case study on breast cancer clinical data.利用多模态和组学机器学习集成(MOMLIN)推进药物反应预测:乳腺癌临床数据案例研究。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae300.
4
Using machine learning approaches for multi-omics data analysis: A review.使用机器学习方法进行多组学数据分析:综述
Biotechnol Adv. 2021 Jul-Aug;49:107739. doi: 10.1016/j.biotechadv.2021.107739. Epub 2021 Mar 29.
5
Precision oncology: a review to assess interpretability in several explainable methods.精准肿瘤学:评估几种可解释方法的可解释性的综述。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad200.
6
Are innovation and new technologies in precision medicine paving a new era in patients centric care?精准医学中的创新和新技术是否正在开创以患者为中心的护理新时代?
J Transl Med. 2019 Apr 5;17(1):114. doi: 10.1186/s12967-019-1864-9.
7
From imaging a single cell to implementing precision medicine: an exciting new era.从对单个细胞进行成像到实施精准医学:激动人心的新时代。
Emerg Top Life Sci. 2021 Dec 21;5(6):837-847. doi: 10.1042/ETLS20210219.
8
Harnessing Deep Learning for Omics in an Era of COVID-19.利用深度学习应对 COVID-19 时代的组学研究。
OMICS. 2023 Apr;27(4):141-152. doi: 10.1089/omi.2022.0155.
9
Machine learning meets omics: applications and perspectives.机器学习与组学的融合:应用与展望。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab460.
10
Multi-Omics Model Applied to Cancer Genetics.多组学模型在癌症遗传学中的应用。
Int J Mol Sci. 2021 May 27;22(11):5751. doi: 10.3390/ijms22115751.

引用本文的文献

1
Feature graphs for interpretable unsupervised tree ensembles: centrality, interaction, and application in disease subtyping.用于可解释无监督树集成的特征图:中心性、相互作用及其在疾病亚型分类中的应用
BioData Min. 2025 Feb 15;18(1):15. doi: 10.1186/s13040-025-00430-3.
2
LAVASET: Latent Variable Stochastic Ensemble of Trees. An ensemble method for correlated datasets with spatial, spectral, and temporal dependencies.LAVASET:潜在变量随机树集成。一种用于具有空间、光谱和时间依赖性的相关数据集的集成方法。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae101.
3
A machine learning and deep learning-based integrated multi-omics technique for leukemia prediction.

本文引用的文献

1
GNN-SubNet: disease subnetwork detection with explainable graph neural networks.GNN-SubNet:基于可解释图神经网络的疾病子网络检测。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii120-ii126. doi: 10.1093/bioinformatics/btac478.
2
A Signature of 14 Long Non-Coding RNAs (lncRNAs) as a Step towards Precision Diagnosis for NSCLC.14种长链非编码RNA(lncRNA)作为迈向非小细胞肺癌精准诊断的标志
Cancers (Basel). 2022 Jan 16;14(2):439. doi: 10.3390/cancers14020439.
3
Higher-Order Explanations of Graph Neural Networks via Relevant Walks.通过相关游走对图神经网络进行高阶解释。
一种基于机器学习和深度学习的用于白血病预测的集成多组学技术。
Heliyon. 2024 Feb 1;10(3):e25369. doi: 10.1016/j.heliyon.2024.e25369. eCollection 2024 Feb 15.
4
[Explainable artificial intelligence in pathology].[病理学中的可解释人工智能]
Pathologie (Heidelb). 2024 Mar;45(2):133-139. doi: 10.1007/s00292-024-01308-7. Epub 2024 Feb 5.
5
FHBF: Federated hybrid boosted forests with dropout rates for supervised learning tasks across highly imbalanced clinical datasets.FHBF:用于高度不平衡临床数据集上监督学习任务的具有辍学率的联邦混合增强森林
Patterns (N Y). 2024 Jan 12;5(1):100893. doi: 10.1016/j.patter.2023.100893.
6
DM-MOGA: a multi-objective optimization genetic algorithm for identifying disease modules of non-small cell lung cancer.DM-MOGA:一种用于识别非小细胞肺癌疾病模块的多目标优化遗传算法。
BMC Bioinformatics. 2023 Jan 9;24(1):13. doi: 10.1186/s12859-023-05136-z.
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7581-7596. doi: 10.1109/TPAMI.2021.3115452. Epub 2022 Oct 4.
4
DeepOmix: A scalable and interpretable multi-omics deep learning framework and application in cancer survival analysis.深度混合模型(DeepOmix):一种可扩展且可解释的多组学深度学习框架及其在癌症生存分析中的应用。
Comput Struct Biotechnol J. 2021 May 1;19:2719-2725. doi: 10.1016/j.csbj.2021.04.067. eCollection 2021.
5
Explaining decisions of graph convolutional neural networks: patient-specific molecular subnetworks responsible for metastasis prediction in breast cancer.解释图卷积神经网络决策:乳腺癌转移预测中与患者特异性相关的分子子网络。
Genome Med. 2021 Mar 11;13(1):42. doi: 10.1186/s13073-021-00845-7.
6
BIRC5 is a prognostic biomarker associated with tumor immune cell infiltration.BIRC5 是与肿瘤免疫细胞浸润相关的预后生物标志物。
Sci Rep. 2021 Jan 11;11(1):390. doi: 10.1038/s41598-020-79736-7.
7
Differential Expression of ADP/ATP Carriers as a Biomarker of Metabolic Remodeling and Survival in Kidney Cancers.ADP/ATP载体的差异表达作为肾癌代谢重塑和生存的生物标志物
Biomolecules. 2020 Dec 30;11(1):38. doi: 10.3390/biom11010038.
8
A hierarchical clustering and data fusion approach for disease subtype discovery.一种用于疾病亚型发现的层次聚类和数据融合方法。
J Biomed Inform. 2021 Jan;113:103636. doi: 10.1016/j.jbi.2020.103636. Epub 2020 Nov 30.
9
From Local Explanations to Global Understanding with Explainable AI for Trees.利用可解释人工智能实现从局部解释到树木的全局理解
Nat Mach Intell. 2020 Jan;2(1):56-67. doi: 10.1038/s42256-019-0138-9. Epub 2020 Jan 17.
10
GNNExplainer: Generating Explanations for Graph Neural Networks.GNNExplainer:为图神经网络生成解释
Adv Neural Inf Process Syst. 2019 Dec;32:9240-9251.