病例对照基因关联研究中具有多个次要结局的SNP集分析的加权伪似然法

Weighted pseudolikelihood for SNP set analysis with multiple secondary outcomes in case-control genetic association studies.

作者信息

Sofer Tamar, Schifano Elizabeth D, Christiani David C, Lin Xihong

机构信息

Department of Biostatistics, University of Washington, Seattle, Washington 98105, U.S.A.

Department of Statistics, University of Connecticut, Storrs, Connecticut 06269, U.S.A.

出版信息

Biometrics. 2017 Dec;73(4):1210-1220. doi: 10.1111/biom.12680. Epub 2017 Mar 27.

DOI:10.1111/biom.12680

PMID:28346824

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5617769/

Abstract

We propose a weighted pseudolikelihood method for analyzing the association of a SNP set, example, SNPs in a gene or a genetic pathway or network, with multiple secondary phenotypes in case-control genetic association studies. To boost analysis power, we assume that the SNP-specific effects are shared across all secondary phenotypes using a scaled mean model. We estimate regression parameters using Inverse Probability Weighted (IPW) estimating equations obtained from the weighted pseudolikelihood, which accounts for case-control sampling to prevent potential ascertainment bias. To test the effect of a SNP set, we propose a weighted variance component pseudo-score test. We also propose a penalized IPW pseudolikelihood method for selecting a subset of SNPs that are associated with the multiple secondary phenotypes. We show that the proposed variable selection procedure has the oracle properties and is robust to misspecification of the correlation structure among secondary phenotypes. We select the tuning parameter using a weighted Bayesian Information-like Criterion (wBIC). We evaluate the finite sample performance of the proposed methods via simulations, and illustrate the methods by the analysis of the multiple secondary smoking behavior outcomes in a lung cancer case-control genetic association study.

摘要

我们提出了一种加权伪似然方法，用于在病例对照基因关联研究中分析单核苷酸多态性（SNP）集（例如，一个基因、一条遗传通路或网络中的SNP）与多个次要表型之间的关联。为了提高分析效能，我们使用缩放均值模型假设SNP特异性效应在所有次要表型中是共享的。我们使用从加权伪似然中获得的逆概率加权（IPW）估计方程来估计回归参数，该方程考虑了病例对照抽样以防止潜在的确定偏倚。为了检验SNP集的效应，我们提出了一种加权方差分量伪得分检验。我们还提出了一种惩罚IPW伪似然方法，用于选择与多个次要表型相关的SNP子集。我们表明，所提出的变量选择程序具有神谕属性，并且对次要表型之间相关结构的错误设定具有鲁棒性。我们使用加权贝叶斯信息准则（wBIC）选择调整参数。我们通过模拟评估所提出方法的有限样本性能，并通过对肺癌病例对照基因关联研究中多个次要吸烟行为结果的分析来说明这些方法。

相似文献

Weighted pseudolikelihood for SNP set analysis with multiple secondary outcomes in case-control genetic association studies.

Biometrics. 2017 Dec;73(4):1210-1220. doi: 10.1111/biom.12680. Epub 2017 Mar 27.

Genome-wide association analysis for multiple continuous secondary phenotypes.

Am J Hum Genet. 2013 May 2;92(5):744-59. doi: 10.1016/j.ajhg.2013.04.004.

Robust analysis of secondary phenotypes in case-control genetic association studies.

Stat Med. 2016 Oct 15;35(23):4226-37. doi: 10.1002/sim.6976. Epub 2016 May 30.

Retrospective likelihood-based methods for analyzing case-cohort genetic association studies.

Biometrics. 2015 Dec;71(4):960-8. doi: 10.1111/biom.12342. Epub 2015 Jul 14.

Estimation of odds ratios of genetic variants for the secondary phenotypes associated with primary diseases.

Genet Epidemiol. 2011 Apr;35(3):190-200. doi: 10.1002/gepi.20568. Epub 2011 Feb 9.

An efficient weighted tag SNP-set analytical method in genome-wide association studies.

BMC Genet. 2015 Mar 13;16:25. doi: 10.1186/s12863-015-0182-3.

Secondary phenotype analysis in ascertained family designs: application to the Leiden longevity study.

Stat Med. 2017 Jun 30;36(14):2288-2301. doi: 10.1002/sim.7281. Epub 2017 Mar 16.

Validity of using ad hoc methods to analyze secondary traits in case-control association studies.

Genet Epidemiol. 2016 Dec;40(8):732-743. doi: 10.1002/gepi.21994. Epub 2016 Sep 26.

Likelihood ratio tests in rare variant detection for continuous phenotypes.

Ann Hum Genet. 2014 Sep;78(5):320-32. doi: 10.1111/ahg.12071.

Analysis of secondary phenotypes in multigroup association studies.

Biometrics. 2020 Jun;76(2):606-618. doi: 10.1111/biom.13157. Epub 2019 Nov 11.

引用本文的文献

Estimation of total mediation effect for a binary trait in a case-control study for high-dimensional omics mediators.

bioRxiv. 2025 Feb 2:2025.01.28.635396. doi: 10.1101/2025.01.28.635396.

本文引用的文献

VARIABLE SELECTION FOR HIGH DIMENSIONAL MULTIVARIATE OUTCOMES.

Stat Sin. 2014 Oct;24(4):1633-1654. doi: 10.5705/ss.2013.019.

Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits.

Genetics. 2015 Jan;199(1):205-22. doi: 10.1534/genetics.114.167817. Epub 2014 Oct 28.

Efficient multivariate linear mixed model algorithms for genome-wide association studies.

Nat Methods. 2014 Apr;11(4):407-9. doi: 10.1038/nmeth.2848. Epub 2014 Feb 16.

A general regression framework for a secondary outcome in case-control studies.

Biostatistics. 2014 Jan;15(1):117-28. doi: 10.1093/biostatistics/kxt041. Epub 2013 Oct 22.

Quantitative trait analysis in sequencing studies under trait-dependent sampling.

Proc Natl Acad Sci U S A. 2013 Jul 23;110(30):12247-52. doi: 10.1073/pnas.1221713110. Epub 2013 Jul 11.

Genome-wide association analysis for multiple continuous secondary phenotypes.

Am J Hum Genet. 2013 May 2;92(5):744-59. doi: 10.1016/j.ajhg.2013.04.004.

Polygenic modeling with bayesian sparse linear mixed models.

PLoS Genet. 2013;9(2):e1003264. doi: 10.1371/journal.pgen.1003264. Epub 2013 Feb 7.

Optimal tests for rare variant effects in sequencing association studies.

Biostatistics. 2012 Sep;13(4):762-75. doi: 10.1093/biostatistics/kxs014. Epub 2012 Jun 14.

A Gaussian copula approach for the analysis of secondary phenotypes in case-control genetic association studies.

Biostatistics. 2012 Jul;13(3):497-508. doi: 10.1093/biostatistics/kxr025. Epub 2011 Sep 19.

Rare-variant association testing for sequencing data with the sequence kernel association test.

Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

病例对照基因关联研究中具有多个次要结局的SNP集分析的加权伪似然法

Weighted pseudolikelihood for SNP set analysis with multiple secondary outcomes in case-control genetic association studies.

作者信息

Sofer Tamar, Schifano Elizabeth D, Christiani David C, Lin Xihong

机构信息

Department of Biostatistics, University of Washington, Seattle, Washington 98105, U.S.A.

Department of Statistics, University of Connecticut, Storrs, Connecticut 06269, U.S.A.

出版信息

Biometrics. 2017 Dec;73(4):1210-1220. doi: 10.1111/biom.12680. Epub 2017 Mar 27.

DOI:10.1111/biom.12680

PMID:28346824

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5617769/

Abstract

摘要

病例对照基因关联研究中具有多个次要结局的SNP集分析的加权伪似然法

Weighted pseudolikelihood for SNP set analysis with multiple secondary outcomes in case-control genetic association studies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

病例对照基因关联研究中具有多个次要结局的SNP集分析的加权伪似然法

Weighted pseudolikelihood for SNP set analysis with multiple secondary outcomes in case-control genetic association studies.

作者信息

机构信息

出版信息