基于序列核关联检验的测序数据罕见变异关联分析

Rare-variant association testing for sequencing data with the sequence kernel association test.

机构信息

Department of Biostatistics, The University of North Carolina at Chapel Hill, 27599, USA.

出版信息

Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.

DOI:10.1016/j.ajhg.2011.05.029

PMID:21737059

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3135811/

Abstract

Sequencing studies are increasingly being conducted to identify rare variants associated with complex traits. The limited power of classical single-marker association analysis for rare variants poses a central challenge in such studies. We propose the sequence kernel association test (SKAT), a supervised, flexible, computationally efficient regression method to test for association between genetic variants (common and rare) in a region and a continuous or dichotomous trait while easily adjusting for covariates. As a score-based variance-component test, SKAT can quickly calculate p values analytically by fitting the null model containing only the covariates, and so can easily be applied to genome-wide data. Using SKAT to analyze a genome-wide sequencing study of 1000 individuals, by segmenting the whole genome into 30 kb regions, requires only 7 hr on a laptop. Through analysis of simulated data across a wide range of practical scenarios and triglyceride data from the Dallas Heart Study, we show that SKAT can substantially outperform several alternative rare-variant association tests. We also provide analytic power and sample-size calculations to help design candidate-gene, whole-exome, and whole-genome sequence association studies.

摘要

测序研究越来越多地用于识别与复杂性状相关的罕见变异。在这种研究中，经典的单标记关联分析对罕见变异的有限功效构成了核心挑战。我们提出了序列核关联测试（SKAT），这是一种受监督的、灵活的、计算效率高的回归方法，用于测试遗传变异（常见和罕见）在一个区域与连续或二分类性状之间的关联，同时轻松调整协变量。作为基于评分的方差分量检验，SKAT 可以通过拟合仅包含协变量的零模型来快速分析 p 值，因此可以轻松应用于全基因组数据。使用 SKAT 对 1000 个人的全基因组测序研究进行分析，通过将整个基因组划分为 30 kb 区域，仅在笔记本电脑上需要 7 小时。通过对广泛的实际情况的模拟数据和达拉斯心脏研究中的甘油三酯数据进行分析，我们表明 SKAT 可以大大优于几种替代的罕见变异关联测试。我们还提供了分析能力和样本量计算，以帮助设计候选基因、外显子组和全基因组序列关联研究。

相似文献

Rare-variant association testing for sequencing data with the sequence kernel association test.

Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.

Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies.

Am J Hum Genet. 2012 Aug 10;91(2):224-37. doi: 10.1016/j.ajhg.2012.06.007. Epub 2012 Aug 2.

A Comparison Study of Fixed and Mixed Effect Models for Gene Level Association Studies of Complex Traits.

Genet Epidemiol. 2016 Dec;40(8):702-721. doi: 10.1002/gepi.21984. Epub 2016 Jul 4.

Sequence kernel association tests for the combined effect of rare and common variants.

Am J Hum Genet. 2013 Jun 6;92(6):841-53. doi: 10.1016/j.ajhg.2013.04.015. Epub 2013 May 16.

Generalized functional linear models for gene-based case-control association studies.

Genet Epidemiol. 2014 Nov;38(7):622-637. doi: 10.1002/gepi.21840. Epub 2014 Sep 9.

On Efficient and Accurate Calculation of Significance P-Values for Sequence Kernel Association Testing of Variant Set.

Ann Hum Genet. 2016 Mar;80(2):123-35. doi: 10.1111/ahg.12144. Epub 2016 Jan 12.

Optimal tests for rare variant effects in sequencing association studies.

Biostatistics. 2012 Sep;13(4):762-75. doi: 10.1093/biostatistics/kxs014. Epub 2012 Jun 14.

Functional linear models for association analysis of quantitative traits.

Genet Epidemiol. 2013 Nov;37(7):726-42. doi: 10.1002/gepi.21757.

Kernel-machine testing coupled with a rank-truncation method for genetic pathway analysis.

Genet Epidemiol. 2014 Jul;38(5):447-56. doi: 10.1002/gepi.21813. Epub 2014 May 21.

On Robust Association Testing for Quantitative Traits and Rare Variants.

G3 (Bethesda). 2016 Dec 7;6(12):3941-3950. doi: 10.1534/g3.116.035485.

引用本文的文献

A novel two-sample Mendelian randomization framework integrating common and rare variants: application to assess the effect of HDL-C on preeclampsia risk.

medRxiv. 2025 Aug 24:2025.08.20.25334100. doi: 10.1101/2025.08.20.25334100.

Set-Based Tests for Genetic Association Studies with Interval-Censored Competing Risks Outcomes.

Stat Biosci. 2024 Jul 13. doi: 10.1007/s12561-024-09448-3.

A practical guide to identifying associations between tandem repeats and complex human traits using consensus genotypes from multiple tools.

Nat Protoc. 2025 Sep 1. doi: 10.1038/s41596-025-01231-y.

Winner's curse in rare variant analysis: effect size estimation bias depends on effect direction and the association method used.

Front Genet. 2025 Aug 8;16:1416673. doi: 10.3389/fgene.2025.1416673. eCollection 2025.

Leveraging functional annotations to map rare variants associated with Alzheimer disease with gruyere.

Am J Hum Genet. 2025 Aug 13. doi: 10.1016/j.ajhg.2025.07.016.

Leveraging multimodal neuroimaging and GWAS for identifying modality-level causal pathways to Alzheimer's disease.

Imaging Neurosci (Camb). 2025 May 16;3. doi: 10.1162/imag_a_00580. eCollection 2025.

Spatial-extent inference for testing variance components in reliability and heritability studies.

Imaging Neurosci (Camb). 2024 Jan 9;2. doi: 10.1162/imag_a_00058. eCollection 2024.

Noncoding rare variant associations with blood traits in 166,740 UK Biobank genomes.

Nat Genet. 2025 Aug 6. doi: 10.1038/s41588-025-02288-x.

Gene-Based Burden Testing of Rare Variants in Hemiplegic Migraine: A Computational Approach to Uncover the Genetic Architecture of a Rare Brain Disorder.

Genes (Basel). 2025 Jul 9;16(7):807. doi: 10.3390/genes16070807.

Towards improved fine-mapping of candidate causal variants.

Nat Rev Genet. 2025 Jul 28. doi: 10.1038/s41576-025-00869-4.

本文引用的文献

Testing for an unusual distribution of rare variants.

PLoS Genet. 2011 Mar;7(3):e1001322. doi: 10.1371/journal.pgen.1001322. Epub 2011 Mar 3.

Extending rare-variant testing strategies: analysis of noncoding sequence and imputed genotypes.

Am J Hum Genet. 2010 Nov 12;87(5):604-17. doi: 10.1016/j.ajhg.2010.10.012.

Rare variant association analysis methods for complex traits.

Annu Rev Genet. 2010;44:293-308. doi: 10.1146/annurev-genet-102209-163421.

A map of human genome variation from population-scale sequencing.

Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.

Challenges in the identification and use of rare disease-associated predisposition variants.

Curr Opin Genet Dev. 2010 Jun;20(3):277-81. doi: 10.1016/j.gde.2010.05.005.

Powerful SNP-set analysis for case-control genome-wide association studies.

Am J Hum Genet. 2010 Jun 11;86(6):929-42. doi: 10.1016/j.ajhg.2010.05.002.

Missing heritability and strategies for finding the underlying causes of complex disease.

Nat Rev Genet. 2010 Jun;11(6):446-50. doi: 10.1038/nrg2809.

Pooled association tests for rare variants in exon-resequencing studies.

Am J Hum Genet. 2010 Jun 11;86(6):832-8. doi: 10.1016/j.ajhg.2010.04.005. Epub 2010 May 13.

A data-adaptive sum test for disease association with multiple common or rare variants.

Hum Hered. 2010;70(1):42-54. doi: 10.1159/000288704. Epub 2010 Apr 23.

Accurate detection and genotyping of SNPs utilizing population sequencing data.

Genome Res. 2010 Apr;20(4):537-45. doi: 10.1101/gr.100040.109. Epub 2010 Feb 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于序列核关联检验的测序数据罕见变异关联分析

Rare-variant association testing for sequencing data with the sequence kernel association test.

机构信息

Department of Biostatistics, The University of North Carolina at Chapel Hill, 27599, USA.

出版信息

Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.

DOI:10.1016/j.ajhg.2011.05.029

PMID:21737059

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3135811/

Abstract

摘要

基于序列核关联检验的测序数据罕见变异关联分析

Rare-variant association testing for sequencing data with the sequence kernel association test.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于序列核关联检验的测序数据罕见变异关联分析

Rare-variant association testing for sequencing data with the sequence kernel association test.

机构信息

出版信息