Suppr超能文献

Netboost:支持提升的网络分析可改善急性髓系白血病和亨廷顿病的高维组学预测。

Netboost: Boosting-Supported Network Analysis Improves High-Dimensional Omics Prediction in Acute Myeloid Leukemia and Huntington's Disease.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2635-2648. doi: 10.1109/TCBB.2020.2983010. Epub 2021 Dec 8.

Abstract

State-of-the art selection methods fail to identify weak but cumulative effects of features found in many high-dimensional omics datasets. Nevertheless, these features play an important role in certain diseases. We present Netboost, a three-step dimension reduction technique. First, a boosting-based filter is combined with the topological overlap measure to identify the essential edges of the network. Second, sparse hierarchical clustering is applied on the selected edges to identify modules and finally module information is aggregated by the first principal components. We demonstrate the application of the newly developed Netboost in combination with CoxBoost for survival prediction of DNA methylation and gene expression data from 180 acute myeloid leukemia (AML) patients and show, based on cross-validated prediction error curve estimates, its prediction superiority over variable selection on the full dataset as well as over an alternative clustering approach. The identified signature related to chromatin modifying enzymes was replicated in an independent dataset, the phase II AMLSG 12-09 study. In a second application we combine Netboost with Random Forest classification and improve the disease classification error in RNA-sequencing data of Huntington's disease mice. Netboost is a freely available Bioconductor R package for dimension reduction and hypothesis generation in high-dimensional omics applications.

摘要

目前的选择方法无法识别在许多高维组学数据集中发现的特征的微弱但累积的影响。然而,这些特征在某些疾病中起着重要作用。我们提出了 Netboost,这是一种三步降维技术。首先,基于提升的过滤器与拓扑重叠度量相结合,以识别网络的基本边缘。其次,稀疏层次聚类应用于选定的边缘,以识别模块,最后通过第一主成分聚合模块信息。我们展示了新开发的 Netboost 与 CoxBoost 的联合应用,用于对来自 180 名急性髓系白血病 (AML)患者的 DNA 甲基化和基因表达数据进行生存预测,并基于交叉验证预测误差曲线估计,证明了其在全数据集上进行变量选择以及替代聚类方法的预测优势。与染色质修饰酶相关的鉴定特征在独立数据集,即 AMLSG 12-09 研究的第二阶段中得到了复制。在第二个应用中,我们将 Netboost 与随机森林分类相结合,提高了亨廷顿病小鼠 RNA 测序数据的疾病分类错误率。Netboost 是一个免费的 Bioconductor R 包,用于在高维组学应用中进行降维和假设生成。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验