Suppr超能文献

贝叶斯半参数回归模型评估通路对连续和二分类临床结局的影响。

Bayesian semiparametric regression models for evaluating pathway effects on continuous and binary clinical outcomes.

机构信息

Department of Statistics, Virginia Polytechnic Institute and State University, Blacksburg, VA, U.S.A.

出版信息

Stat Med. 2012 Jul 10;31(15):1633-51. doi: 10.1002/sim.4493. Epub 2012 Mar 22.

Abstract

Many statistical methods for microarray data analysis consider one gene at a time, and they may miss subtle changes at the single gene level. This limitation may be overcome by considering a set of genes simultaneously where the gene sets are derived from prior biological knowledge. Limited work has been carried out in the regression setting to study the effects of clinical covariates and expression levels of genes in a pathway either on a continuous or on a binary clinical outcome. Hence, we propose a Bayesian approach for identifying pathways related to both types of outcomes. We compare our Bayesian approaches with a likelihood-based approach that was developed by relating a least squares kernel machine for nonparametric pathway effect with a restricted maximum likelihood for variance components. Unlike the likelihood-based approach, the Bayesian approach allows us to directly estimate all parameters and pathway effects. It can incorporate prior knowledge into Bayesian hierarchical model formulation and makes inference by using the posterior samples without asymptotic theory. We consider several kernels (Gaussian, polynomial, and neural network kernels) to characterize gene expression effects in a pathway on clinical outcomes. Our simulation results suggest that the Bayesian approach has more accurate coverage probability than the likelihood-based approach, and this is especially so when the sample size is small compared with the number of genes being studied in a pathway. We demonstrate the usefulness of our approaches through its applications to a type II diabetes mellitus data set. Our approaches can also be applied to other settings where a large number of strongly correlated predictors are present.

摘要

许多微阵列数据分析的统计方法一次只考虑一个基因,它们可能会错过单个基因水平上的细微变化。通过同时考虑一组基因,可以克服这一局限性,其中基因集来自先前的生物学知识。在回归设置中,已经开展了有限的工作来研究临床协变量和途径中基因的表达水平对连续或二进制临床结果的影响。因此,我们提出了一种贝叶斯方法来识别与这两种结果都相关的途径。我们将我们的贝叶斯方法与基于似然的方法进行了比较,该方法通过将非参数途径效应的最小二乘核机器与方差分量的最大限制似然相关联来开发。与基于似然的方法不同,贝叶斯方法允许我们直接估计所有参数和途径效应。它可以将先验知识纳入贝叶斯层次模型的公式化中,并通过使用后验样本而无需渐近理论进行推断。我们考虑了几种核(高斯核、多项式核和神经网络核)来描述途径中基因表达对临床结果的影响。我们的模拟结果表明,贝叶斯方法的覆盖率概率比基于似然的方法更准确,尤其是当样本量与途径中研究的基因数量相比较小时。我们通过将其应用于 II 型糖尿病数据集来证明我们方法的有效性。我们的方法还可以应用于存在大量强相关预测因子的其他环境中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dfa/3763871/880bbf097ef9/nihms455579f1.jpg

相似文献

4
Semiparametric Bayesian kernel survival model for evaluating pathway effects.半参数贝叶斯核生存模型用于评估途径效应。
Stat Methods Med Res. 2019 Oct-Nov;28(10-11):3301-3317. doi: 10.1177/0962280218797360. Epub 2018 Oct 5.

引用本文的文献

6
Kernel machine SNP-set testing under multiple candidate kernels.基于多个候选核的核机器 SNP 集检验。
Genet Epidemiol. 2013 Apr;37(3):267-75. doi: 10.1002/gepi.21715. Epub 2013 Mar 7.

本文引用的文献

3
Association of SGK1 gene polymorphisms with type 2 diabetes.SGK1基因多态性与2型糖尿病的关联。
Cell Physiol Biochem. 2008;21(1-3):151-60. doi: 10.1159/000113757. Epub 2008 Jan 16.
5
Pathway analysis using random forests classification and regression.使用随机森林分类和回归的通路分析
Bioinformatics. 2006 Aug 15;22(16):2028-36. doi: 10.1093/bioinformatics/btl344. Epub 2006 Jun 29.
10
BagBoosting for tumor classification with gene expression data.用于基于基因表达数据的肿瘤分类的BagBoosting算法
Bioinformatics. 2004 Dec 12;20(18):3583-93. doi: 10.1093/bioinformatics/bth447. Epub 2004 Oct 5.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验