• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在病例对照关联研究中使用临时方法分析次要性状的有效性。

Validity of using ad hoc methods to analyze secondary traits in case-control association studies.

作者信息

Yung Godwin, Lin Xihong

机构信息

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.

出版信息

Genet Epidemiol. 2016 Dec;40(8):732-743. doi: 10.1002/gepi.21994. Epub 2016 Sep 26.

DOI:10.1002/gepi.21994
PMID:27670932
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5170877/
Abstract

Case-control association studies often collect from their subjects information on secondary phenotypes. Reusing the data and studying the association between genes and secondary phenotypes provide an attractive and cost-effective approach that can lead to discovery of new genetic associations. A number of approaches have been proposed, including simple and computationally efficient ad hoc methods that ignore ascertainment or stratify on case-control status. Justification for these approaches relies on the assumption of no covariates and the correct specification of the primary disease model as a logistic model. Both might not be true in practice, for example, in the presence of population stratification or the primary disease model following a probit model. In this paper, we investigate the validity of ad hoc methods in the presence of covariates and possible disease model misspecification. We show that in taking an ad hoc approach, it may be desirable to include covariates that affect the primary disease in the secondary phenotype model, even though these covariates are not necessarily associated with the secondary phenotype. We also show that when the disease is rare, ad hoc methods can lead to severely biased estimation and inference if the true disease model follows a probit model instead of a logistic model. Our results are justified theoretically and via simulations. Applied to real data analysis of genetic associations with cigarette smoking, ad hoc methods collectively identified as highly significant (P<10-5) single nucleotide polymorphisms from over 10 genes, genes that were identified in previous studies of smoking cessation.

摘要

病例对照关联研究通常会收集其研究对象的次要表型信息。重新利用这些数据并研究基因与次要表型之间的关联,提供了一种有吸引力且具有成本效益的方法,可能会带来新的基因关联的发现。已经提出了许多方法,包括简单且计算效率高的临时方法,这些方法忽略了样本确定过程或按病例对照状态进行分层。这些方法的合理性依赖于无协变量的假设以及将原发性疾病模型正确设定为逻辑模型。在实际中这两者可能都不成立,例如,存在群体分层或原发性疾病模型遵循概率单位模型的情况。在本文中,我们研究了在存在协变量和可能的疾病模型错误设定的情况下临时方法的有效性。我们表明,采用临时方法时,在次要表型模型中纳入影响原发性疾病的协变量可能是可取的,即使这些协变量不一定与次要表型相关。我们还表明,当疾病罕见时,如果真实的疾病模型遵循概率单位模型而非逻辑模型,临时方法可能会导致严重有偏的估计和推断。我们的结果在理论上和通过模拟得到了验证。应用于与吸烟的基因关联的实际数据分析时,临时方法共同从超过10个基因中识别出了被确定为高度显著(P<10-5)的单核苷酸多态性,这些基因是在先前戒烟研究中被识别出的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/8c31d70d7b97/nihms802111f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/8e89d656a38d/nihms802111f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/df3508b96a0a/nihms802111f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/46aab6c53e0e/nihms802111f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/91e682d46d29/nihms802111f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/b08846935ee7/nihms802111f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/8c31d70d7b97/nihms802111f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/8e89d656a38d/nihms802111f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/df3508b96a0a/nihms802111f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/46aab6c53e0e/nihms802111f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/91e682d46d29/nihms802111f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/b08846935ee7/nihms802111f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c41b/5170877/8c31d70d7b97/nihms802111f6.jpg

相似文献

1
Validity of using ad hoc methods to analyze secondary traits in case-control association studies.在病例对照关联研究中使用临时方法分析次要性状的有效性。
Genet Epidemiol. 2016 Dec;40(8):732-743. doi: 10.1002/gepi.21994. Epub 2016 Sep 26.
2
A novel association test for multiple secondary phenotypes from a case-control GWAS.一种针对病例对照全基因组关联研究中多个次要表型的新型关联测试。
Genet Epidemiol. 2017 Jul;41(5):413-426. doi: 10.1002/gepi.22045. Epub 2017 Apr 10.
3
Testing Hardy-Weinberg proportions in a frequency-matched case-control genetic association study.在频数匹配的病例对照遗传关联研究中检验 Hardy-Weinberg 平衡。
PLoS One. 2011;6(11):e27642. doi: 10.1371/journal.pone.0027642. Epub 2011 Nov 14.
4
A general approach to testing for pleiotropy with rare and common variants.一种用于检测罕见和常见变异的多效性的通用方法。
Genet Epidemiol. 2017 Feb;41(2):163-170. doi: 10.1002/gepi.22011. Epub 2016 Nov 30.
5
Estimation of odds ratios of genetic variants for the secondary phenotypes associated with primary diseases.遗传变异与主要疾病相关次要表型的优势比估计。
Genet Epidemiol. 2011 Apr;35(3):190-200. doi: 10.1002/gepi.20568. Epub 2011 Feb 9.
6
Using cases to strengthen inference on the association between single nucleotide polymorphisms and a secondary phenotype in genome-wide association studies.使用案例加强全基因组关联研究中单个核苷酸多态性与次要表型之间关联的推断。
Genet Epidemiol. 2010 Jul;34(5):427-33. doi: 10.1002/gepi.20495.
7
A multi-locus genetic association test for a dichotomous trait and its secondary phenotype.多基因关联检验用于二分类性状及其次要表型。
Stat Methods Med Res. 2018 May;27(5):1464-1475. doi: 10.1177/0962280216662071. Epub 2016 Aug 8.
8
Genome-wide association analysis for multiple continuous secondary phenotypes.全基因组关联分析多个连续次要表型。
Am J Hum Genet. 2013 May 2;92(5):744-59. doi: 10.1016/j.ajhg.2013.04.004.
9
Robust analysis of secondary phenotypes in case-control genetic association studies.病例对照基因关联研究中次要表型的稳健分析。
Stat Med. 2016 Oct 15;35(23):4226-37. doi: 10.1002/sim.6976. Epub 2016 May 30.
10
Methods for testing association between uncertain genotypes and quantitative traits.用于检验不确定基因型与数量性状之间关联的方法。
Biostatistics. 2011 Jan;12(1):1-17. doi: 10.1093/biostatistics/kxq039. Epub 2010 Jun 11.

引用本文的文献

1
Feasibility and Accuracy of a Computer-Assisted Self-Interviewing Instrument to Ascertain Prior Immunization With Human Papillomavirus Vaccine by Self-Report: Cross-Sectional Analysis.通过自我报告确定既往人乳头瘤病毒疫苗免疫接种情况的计算机辅助自我访谈工具的可行性和准确性:横断面分析
JMIR Med Inform. 2020 Jan 22;8(1):e16487. doi: 10.2196/16487.
2
Genome-wide analyses of psychological resilience in U.S. Army soldiers.美国陆军士兵心理韧性的全基因组分析。
Am J Med Genet B Neuropsychiatr Genet. 2019 Jul;180(5):310-319. doi: 10.1002/ajmg.b.32730. Epub 2019 May 13.
3
Genome-wide analysis of insomnia disorder.

本文引用的文献

1
Axonal guidance signaling pathway interacting with smoking in modifying the risk of pancreatic cancer: a gene- and pathway-based interaction analysis of GWAS data.轴突导向信号通路与吸烟相互作用,改变胰腺癌风险:基于 GWAS 数据的基因和通路交互分析。
Carcinogenesis. 2014 May;35(5):1039-45. doi: 10.1093/carcin/bgu010. Epub 2014 Jan 13.
2
A general regression framework for a secondary outcome in case-control studies.病例对照研究中二级结局的一般回归框架。
Biostatistics. 2014 Jan;15(1):117-28. doi: 10.1093/biostatistics/kxt041. Epub 2013 Oct 22.
3
Genome-wide association analysis for multiple continuous secondary phenotypes.
失眠症的全基因组分析。
Mol Psychiatry. 2018 Nov;23(11):2238-2250. doi: 10.1038/s41380-018-0033-5. Epub 2018 Mar 8.
4
Multiple phenotype association tests using summary statistics in genome-wide association studies.在全基因组关联研究中使用汇总统计量进行多表型关联测试。
Biometrics. 2018 Mar;74(1):165-175. doi: 10.1111/biom.12735. Epub 2017 Jun 26.
全基因组关联分析多个连续次要表型。
Am J Hum Genet. 2013 May 2;92(5):744-59. doi: 10.1016/j.ajhg.2013.04.004.
4
A meta-analysis identifies new loci associated with body mass index in individuals of African ancestry.一项荟萃分析确定了与非裔个体体重指数相关的新位点。
Nat Genet. 2013 Jun;45(6):690-6. doi: 10.1038/ng.2608. Epub 2013 Apr 14.
5
Informed conditioning on clinical covariates increases power in case-control association studies.在病例对照关联研究中,基于临床协变量进行有信息的条件分析可以提高功效。
PLoS Genet. 2012;8(11):e1003032. doi: 10.1371/journal.pgen.1003032. Epub 2012 Nov 8.
6
Analysis of secondary phenotype involving the interactive effect of the secondary phenotype and genetic variants on the primary disease.对涉及次要表型与遗传变异对原发性疾病的交互作用的次要表型进行分析。
Ann Hum Genet. 2012 Nov;76(6):484-99. doi: 10.1111/j.1469-1809.2012.00725.x. Epub 2012 Aug 10.
7
Including known covariates can reduce power to detect genetic effects in case-control studies.在病例对照研究中,纳入已知协变量会降低检测遗传效应的功效。
Nat Genet. 2012 Jul 22;44(8):848-51. doi: 10.1038/ng.2346.
8
A genome-wide scan of Ashkenazi Jewish Crohn's disease suggests novel susceptibility loci.全基因组扫描分析阿什肯纳兹犹太人群的克罗恩病提示新的易感性基因座。
PLoS Genet. 2012;8(3):e1002559. doi: 10.1371/journal.pgen.1002559. Epub 2012 Mar 8.
9
Meta-analysis identifies common variants associated with body mass index in east Asians.荟萃分析鉴定出与东亚人体重指数相关的常见变异。
Nat Genet. 2012 Feb 19;44(3):307-11. doi: 10.1038/ng.1087.
10
A Gaussian copula approach for the analysis of secondary phenotypes in case-control genetic association studies.基于高斯 Copula 的病例对照遗传关联研究中二级表型分析方法。
Biostatistics. 2012 Jul;13(3):497-508. doi: 10.1093/biostatistics/kxr025. Epub 2011 Sep 19.