Suppr超能文献

一种在相依情况下选择高维数据协变量的统计方法。在肿瘤学中基因图谱分类的应用。

A statistical methodology to select covariates in high-dimensional data under dependence. Application to the classification of genetic profiles in oncology.

作者信息

Bastien B, Boukhobza T, Dumond H, Gégout-Petit A, Muller-Gueudin A, Thiébaut C

机构信息

Transgene S.A., Illkirch-Graffenstaden Cedex, France.

Université de Lorraine, CNRS, CRAN, Nancy, France.

出版信息

J Appl Stat. 2020 Oct 27;49(3):764-781. doi: 10.1080/02664763.2020.1837083. eCollection 2022.

Abstract

We propose a new methodology for selecting and ranking covariates associated with a variable of interest in a context of high-dimensional data under dependence but few observations. The methodology successively intertwines the clustering of covariates, decorrelation of covariates using Factor Latent Analysis, selection using aggregation of adapted methods and finally ranking. A simulation study shows the interest of the decorrelation inside the different clusters of covariates. We first apply our method to transcriptomic data of 37 patients with advanced non-small-cell lung cancer who have received chemotherapy, to select the transcriptomic covariates that explain the survival outcome of the treatment. Secondly, we apply our method to 79 breast tumor samples to define patient profiles for a new metastatic biomarker and associated gene network in order to personalize the treatments.

摘要

我们提出了一种新方法,用于在数据依赖但观测值较少的高维数据环境中,选择与感兴趣变量相关的协变量并对其进行排序。该方法依次将协变量聚类、使用因子潜在分析对协变量进行去相关、通过适配方法的聚合进行选择并最终进行排序。一项模拟研究表明了在不同协变量簇内进行去相关的意义。我们首先将我们的方法应用于37例接受化疗的晚期非小细胞肺癌患者的转录组数据,以选择解释治疗生存结果的转录组协变量。其次,我们将我们的方法应用于79个乳腺肿瘤样本,以定义一种新的转移生物标志物和相关基因网络的患者特征,从而实现个性化治疗。

相似文献

8
Model-free screening for variables with treatment interaction.无模型的治疗交互作用变量筛选。
Stat Methods Med Res. 2022 Oct;31(10):1845-1859. doi: 10.1177/09622802221102624. Epub 2022 May 29.

本文引用的文献

5
Micro-RNAs and breast cancer.微小 RNA 与乳腺癌。
Mol Oncol. 2010 Jun;4(3):230-41. doi: 10.1016/j.molonc.2010.04.009. Epub 2010 Apr 28.
10
Statistical significance for genomewide studies.全基因组研究的统计学显著性
Proc Natl Acad Sci U S A. 2003 Aug 5;100(16):9440-5. doi: 10.1073/pnas.1530509100. Epub 2003 Jul 25.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验