家族数据中共享对照的罕见变异和二分类性状的关联评分检验。

Association score testing for rare variants and binary traits in family data with shared controls.

机构信息

Department of Biostatistics, University of Washington, Seattle, USA.

Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, USA.

出版信息

Brief Bioinform. 2019 Jan 18;20(1):245-253. doi: 10.1093/bib/bbx107.

DOI:10.1093/bib/bbx107

PMID:28968627

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6357552/

Abstract

Genome-wide association studies have been an important approach used to localize trait loci, with primary focus on common variants. The multiple rare variant-common disease hypothesis may explain the missing heritability remaining after accounting for identified common variants. Advances of sequencing technologies with their decreasing costs, coupled with methodological advances in the context of association studies in large samples, now make the study of rare variants at a genome-wide scale feasible. The resurgence of family-based association designs because of their advantage in studying rare variants has also stimulated more methods development, mainly based on linear mixed models (LMMs). Other tests such as score tests can have advantages over the LMMs, but to date have mainly been proposed for single-marker association tests. In this article, we extend several score tests (χcorrected2, WQLS, and SKAT) to the multiple variant association framework. We evaluate and compare their statistical performances relative with the LMM. Moreover, we show that three tests can be cast as the difference between marker allele frequencies (AFs) estimated in each of the group of affected and unaffected subjects. We show that these tests are flexible, as they can be based on related, unrelated or both related and unrelated subjects. They also make feasible an increasingly common design that only sequences a subset of affected subjects (related or unrelated) and uses for comparison publicly available AFs estimated in a group of healthy subjects. Finally, we show the great impact of linkage disequilibrium on the performance of all these tests.

摘要

全基因组关联研究一直是定位性状基因座的重要方法，主要关注常见变体。多种罕见变异-常见疾病假说可以解释在考虑到已鉴定的常见变体后仍然存在的遗传缺失。测序技术的进步及其成本的降低，加上在大样本关联研究背景下方法的进步，现在使得在全基因组范围内研究罕见变体成为可能。由于其在研究罕见变体方面的优势，基于家系的关联设计的复兴也刺激了更多方法的发展，主要基于线性混合模型（LMM）。其他测试，如评分测试，相对于 LMM 可能具有优势，但迄今为止主要是针对单标记关联测试提出的。在本文中，我们将几种评分测试（χ校正 2、WQLS 和 SKAT）扩展到多变体关联框架中。我们评估并比较了它们与 LMM 的统计性能。此外，我们表明，这三个测试可以表示在受影响和未受影响的对象的每个组中估计的标记等位基因频率（AF）之间的差异。我们表明这些测试具有灵活性，因为它们可以基于相关、不相关或相关和不相关的主体。它们还使得一种越来越常见的设计成为可能，即仅对部分受影响的主体（相关或不相关）进行测序，并使用一组健康主体中估计的公共 AF 进行比较。最后，我们表明连锁不平衡对所有这些测试的性能都有很大影响。

相似文献

Association score testing for rare variants and binary traits in family data with shared controls.家族数据中共享对照的罕见变异和二分类性状的关联评分检验。

Brief Bioinform. 2019 Jan 18;20(1):245-253. doi: 10.1093/bib/bbx107.

Joint association analysis of a binary and a quantitative trait in family samples.家系样本中二元性状和数量性状的联合关联分析。

Eur J Hum Genet. 2016 Jan;25(1):130-136. doi: 10.1038/ejhg.2016.134. Epub 2016 Oct 26.

A Comparison Study of Fixed and Mixed Effect Models for Gene Level Association Studies of Complex Traits.复杂性状基因水平关联研究中固定效应模型与混合效应模型的比较研究

Genet Epidemiol. 2016 Dec;40(8):702-721. doi: 10.1002/gepi.21984. Epub 2016 Jul 4.

Functional linear models for association analysis of quantitative traits.功能线性模型在数量性状关联分析中的应用。

Genet Epidemiol. 2013 Nov;37(7):726-42. doi: 10.1002/gepi.21757.

Combining family- and population-based imputation data for association analysis of rare and common variants in large pedigrees.结合基于家系和群体的插补数据，用于大型家系中罕见和常见变异的关联分析。

Genet Epidemiol. 2014 Nov;38(7):579-90. doi: 10.1002/gepi.21844. Epub 2014 Aug 1.

Rare variant association test with multiple phenotypes.针对多种表型的罕见变异关联测试。

Genet Epidemiol. 2017 Apr;41(3):198-209. doi: 10.1002/gepi.22021. Epub 2016 Dec 31.

The power comparison of the haplotype-based collapsing tests and the variant-based collapsing tests for detecting rare variants in pedigrees.基于单倍型的合并检验与基于变异的合并检验在系谱中检测罕见变异的效能比较。

BMC Genomics. 2014 Jul 28;15(1):632. doi: 10.1186/1471-2164-15-632.

A comparison study of multivariate fixed models and Gene Association with Multiple Traits (GAMuT) for next-generation sequencing.下一代测序中多变量固定模型与多性状基因关联分析（GAMuT）的比较研究

Genet Epidemiol. 2017 Jan;41(1):18-34. doi: 10.1002/gepi.22014. Epub 2016 Dec 5.

Weighted pedigree-based statistics for testing the association of rare variants.基于加权家系的统计方法用于检验罕见变异的关联。

BMC Genomics. 2012 Nov 24;13:667. doi: 10.1186/1471-2164-13-667.

Generalized functional linear models for gene-based case-control association studies.用于基于基因的病例对照关联研究的广义功能线性模型。

Genet Epidemiol. 2014 Nov;38(7):622-637. doi: 10.1002/gepi.21840. Epub 2014 Sep 9.

引用本文的文献

A novel rare variants association test for binary traits in family-based designs via copulas.基于 Copula 的家系设计中二元性状的新型罕见变异关联检验

Stat Methods Med Res. 2023 Nov;32(11):2096-2122. doi: 10.1177/09622802231197977. Epub 2023 Oct 13.

An Efficient Bayesian Method for Estimating the Degree of the Skewness of X Chromosome Inactivation Based on the Mixture of General Pedigrees and Unrelated Females.基于混合广义家系和无关女性的 X 染色体失活偏度程度的有效贝叶斯估计方法。

Biomolecules. 2023 Mar 16;13(3):543. doi: 10.3390/biom13030543.

本文引用的文献

Across-cohort QC analyses of GWAS summary statistics from complex traits.复杂性状全基因组关联研究汇总统计数据的跨队列质量控制分析。

Eur J Hum Genet. 2016 Jan;25(1):137-146. doi: 10.1038/ejhg.2016.106. Epub 2016 Aug 24.

Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models.通过逻辑混合模型在遗传关联研究中对二元性状的群体结构和相关性进行控制。

Am J Hum Genet. 2016 Apr 7;98(4):653-66. doi: 10.1016/j.ajhg.2016.02.012. Epub 2016 Mar 24.

Family-based genome scan for age at onset of late-onset Alzheimer's disease in whole exome sequencing data.基于家系的全外显子组测序数据中晚发性阿尔茨海默病发病年龄的基因组扫描。

Genes Brain Behav. 2015 Nov;14(8):607-17. doi: 10.1111/gbb.12250. Epub 2015 Sep 23.

Further improvements to linear mixed models for genome-wide association studies.全基因组关联研究线性混合模型的进一步改进。

Sci Rep. 2014 Nov 12;4:6874. doi: 10.1038/srep06874.

Case-only exome sequencing and complex disease susceptibility gene discovery: study design considerations.仅病例外显子组测序与复杂疾病易感性基因发现：研究设计考量

J Med Genet. 2015 Jan;52(1):10-6. doi: 10.1136/jmedgenet-2014-102697. Epub 2014 Nov 4.

Genet Epidemiol. 2014 Nov;38(7):579-90. doi: 10.1002/gepi.21844. Epub 2014 Aug 1.

Rare-variant association analysis: study designs and statistical tests.罕见变异关联分析：研究设计与统计检验。

Am J Hum Genet. 2014 Jul 3;95(1):5-23. doi: 10.1016/j.ajhg.2014.06.009.

Association analysis using next-generation sequence data from publicly available control groups: the robust variance score statistic.利用公共可用对照组的下一代测序数据进行关联分析：稳健方差得分统计。

Bioinformatics. 2014 Aug 1;30(15):2179-88. doi: 10.1093/bioinformatics/btu196. Epub 2014 Apr 14.

Meta-analysis methods for genome-wide association studies and beyond.全基因组关联研究的荟萃分析方法及其他。

Nat Rev Genet. 2013 Jun;14(6):379-89. doi: 10.1038/nrg3472. Epub 2013 May 9.

Multiple genetic variant association testing by collapsing and kernel methods with pedigree or population structured data.基于家系或群体结构数据的连锁和核方法进行多重遗传变异关联测试。

Genet Epidemiol. 2013 Jul;37(5):409-18. doi: 10.1002/gepi.21727. Epub 2013 May 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验