基于家系的下一代测序关联研究。

Family-based association studies for next-generation sequencing.

机构信息

Human Genetics Center and Division of Biostatistics, The University of Texas School of Public Health, Houston, 77030, USA.

出版信息

Am J Hum Genet. 2012 Jun 8;90(6):1028-45. doi: 10.1016/j.ajhg.2012.04.022.

DOI:10.1016/j.ajhg.2012.04.022

PMID:22682329

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3370281/

Abstract

An individual's disease risk is determined by the compounded action of both common variants, inherited from remote ancestors, that segregated within the population and rare variants, inherited from recent ancestors, that segregated mainly within pedigrees. Next-generation sequencing (NGS) technologies generate high-dimensional data that allow a nearly complete evaluation of genetic variation. Despite their promise, NGS technologies also suffer from remarkable limitations: high error rates, enrichment of rare variants, and a large proportion of missing values, as well as the fact that most current analytical methods are designed for population-based association studies. To meet the analytical challenges raised by NGS, we propose a general framework for sequence-based association studies that can use various types of family and unrelated-individual data sampled from any population structure and a universal procedure that can transform any population-based association test statistic for use in family-based association tests. We develop family-based functional principal-component analysis (FPCA) with or without smoothing, a generalized T(2), combined multivariate and collapsing (CMC) method, and single-marker association test statistics. Through intensive simulations, we demonstrate that the family-based smoothed FPCA (SFPCA) has the correct type I error rates and much more power to detect association of (1) common variants, (2) rare variants, (3) both common and rare variants, and (4) variants with opposite directions of effect from other population-based or family-based association analysis methods. The proposed statistics are applied to two data sets with pedigree structures. The results show that the smoothed FPCA has a much smaller p value than other statistics.

摘要

个体的疾病风险是由共同变体和罕见变体共同作用决定的，这些变体既有来自远古祖先的常见遗传变体，也有来自近代祖先的罕见遗传变体。这些变体在人群中分离，或者主要在系谱中分离。下一代测序（NGS）技术产生的高维数据几乎可以完全评估遗传变异。尽管 NGS 技术有很大的潜力，但它们也有显著的局限性：高错误率、稀有变体的富集、大量缺失值，以及大多数当前分析方法是为基于人群的关联研究而设计的。为了应对 NGS 带来的分析挑战，我们提出了一个基于序列的关联研究的通用框架，该框架可以使用来自任何人群结构的各种类型的家族和无关个体数据，以及一种通用的程序，可以将任何基于人群的关联测试统计量转换为用于家族关联测试的统计量。我们开发了基于家族的功能主成分分析（FPCA），包括平滑和非平滑的 FPCA、广义 T(2)、组合多变量和合并（CMC）方法以及单标记关联测试统计量。通过密集的模拟，我们证明了基于家族的平滑 FPCA（SFPCA）具有正确的 I 型错误率，并且在检测（1）常见变体、（2）罕见变体、（3）常见和罕见变体以及（4）与其他基于人群或家族的关联分析方法具有相反作用方向的变体的关联方面具有更高的功效。所提出的统计方法应用于具有系谱结构的两个数据集。结果表明，平滑 FPCA 的 p 值比其他统计方法小得多。

相似文献

Family-based association studies for next-generation sequencing.基于家系的下一代测序关联研究。

Am J Hum Genet. 2012 Jun 8;90(6):1028-45. doi: 10.1016/j.ajhg.2012.04.022.

Association studies for next-generation sequencing.下一代测序的关联研究。

Genome Res. 2011 Jul;21(7):1099-108. doi: 10.1101/gr.115998.110. Epub 2011 Apr 26.

Weighted pedigree-based statistics for testing the association of rare variants.基于加权家系的统计方法用于检验罕见变异的关联。

BMC Genomics. 2012 Nov 24;13:667. doi: 10.1186/1471-2164-13-667.

Smoothed functional principal component analysis for testing association of the entire allelic spectrum of genetic variation.平滑功能主成分分析检验全等位基因谱遗传变异的关联。

Eur J Hum Genet. 2013 Feb;21(2):217-24. doi: 10.1038/ejhg.2012.141. Epub 2012 Jul 11.

A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.一种基于基因组信息含量的新型统计方法，用于针对下一代测序数据的全基因组关联分析。

J Comput Biol. 2012 Jun;19(6):731-44. doi: 10.1089/cmb.2012.0035. Epub 2012 May 31.

Pathway analysis with next-generation sequencing data.利用下一代测序数据进行通路分析。

Eur J Hum Genet. 2015 Apr;23(4):507-15. doi: 10.1038/ejhg.2014.121. Epub 2014 Jul 2.

Power of family-based association designs to detect rare variants in large pedigrees using imputed genotypes.基于家系的关联设计在使用基因型推断的大型家系中检测罕见变异的能力。

Genet Epidemiol. 2014 Jan;38(1):1-9. doi: 10.1002/gepi.21776. Epub 2013 Nov 15.

Single-variant and multi-variant trend tests for genetic association with next-generation sequencing that are robust to sequencing error.对下一代测序基因关联进行单变量和多变量趋势检验，对测序错误具有稳健性。

Hum Hered. 2012;74(3-4):172-83. doi: 10.1159/000346824. Epub 2013 Apr 11.

Design of association studies with pooled or un-pooled next-generation sequencing data.基于汇集或未汇集下一代测序数据的关联研究设计。

Genet Epidemiol. 2010 Jul;34(5):479-91. doi: 10.1002/gepi.20501.

Resequencing of pooled DNA for detecting disease associations with rare variants.对 pooled DNA 进行重测序以检测与罕见变异相关的疾病关联。

Genet Epidemiol. 2010 Jul;34(5):492-501. doi: 10.1002/gepi.20502.

引用本文的文献

Personalized Nutrition: Tailoring Dietary Recommendations through Genetic Insights.个性化营养：通过基因洞察定制饮食建议。

Nutrients. 2024 Aug 13;16(16):2673. doi: 10.3390/nu16162673.

Identification of New Rare Variants Associated With Familial Autoimmune Thyroid Diseases by Deep Sequencing of Linked Loci.通过连锁区域的深度测序鉴定与家族性自身免疫性甲状腺疾病相关的新罕见变异。

J Clin Endocrinol Metab. 2021 Oct 21;106(11):e4680-e4687. doi: 10.1210/clinem/dgab440.

What is the right sequencing approach? Solo VS extended family analysis in consanguineous populations.正确的测序方法是什么？在血缘人群中，独奏与扩展家庭分析。

BMC Med Genomics. 2020 Jul 17;13(1):103. doi: 10.1186/s12920-020-00743-8.

Gene-based association analysis of survival traits via functional regression-based mixed effect cox models for related samples.基于功能回归的混合效应 Cox 模型对相关样本进行生存性状的基因关联分析。

Genet Epidemiol. 2019 Dec;43(8):952-965. doi: 10.1002/gepi.22254. Epub 2019 Sep 10.

Data-adaptive multi-locus association testing in subjects with arbitrary genealogical relationships.对具有任意谱系关系的受试者进行数据自适应多位点关联测试。

Stat Appl Genet Mol Biol. 2019 Apr 8;18(3):/j/sagmb.2019.18.issue-3/sagmb-2018-0030/sagmb-2018-0030.xml. doi: 10.1515/sagmb-2018-0030.

WISARD: workbench for integrated superfast association studies for related datasets.WISARD：用于相关数据集的集成超快速关联研究的工作台。

BMC Med Genomics. 2018 Apr 20;11(Suppl 2):39. doi: 10.1186/s12920-018-0345-y.

Detecting Multiethnic Rare Variants.检测多民族罕见变异体。

Methods Mol Biol. 2017;1666:527-538. doi: 10.1007/978-1-4939-7274-6_26.

A multistep approach to single nucleotide polymorphism-set analysis: an evaluation of power and type I error of gene-based tests of association after pathway-based association tests.一种用于单核苷酸多态性集分析的多步骤方法：基于通路的关联测试后基于基因的关联测试的效能和I型错误评估。

BMC Proc. 2016 Oct 18;10(Suppl 7):349-355. doi: 10.1186/s12919-016-0055-4. eCollection 2016.

Prioritization of family member sequencing for the detection of rare variants.为检测罕见变异对家庭成员测序进行优先级排序。

BMC Proc. 2016 Oct 18;10(Suppl 7):227-231. doi: 10.1186/s12919-016-0035-8. eCollection 2016.

A novel statistical method for rare-variant association studies in general pedigrees.一种用于一般家系中罕见变异关联研究的新型统计方法。

BMC Proc. 2016 Oct 18;10(Suppl 7):193-196. doi: 10.1186/s12919-016-0029-6. eCollection 2016.

本文引用的文献

Genomics is not enough.基因组学是不够的。

Science. 2011 Oct 7;334(6052):15. doi: 10.1126/science.1214458.

Clan genomics and the complex architecture of human disease.族基因组学与人类疾病的复杂结构。

Cell. 2011 Sep 30;147(1):32-43. doi: 10.1016/j.cell.2011.09.008.

Deep sequencing reveals 50 novel genes for recessive cognitive disorders.深度测序揭示 50 个隐性认知障碍的新基因。

Nature. 2011 Sep 21;478(7367):57-63. doi: 10.1038/nature10423.

Short interfering RNA against STAT1 attenuates cisplatin-induced ototoxicity in the rat by suppressing inflammation.短干扰 RNA 靶向 STAT1 抑制炎症减轻顺铂诱导的大鼠耳毒性。

Cell Death Dis. 2011 Jul 21;2(7):e180. doi: 10.1038/cddis.2011.63.

Epigenome-wide association studies for common human diseases.全基因组关联研究常见人类疾病。

Nat Rev Genet. 2011 Jul 12;12(8):529-41. doi: 10.1038/nrg3000.

Family-based designs for genome-wide association studies.基于家系的全基因组关联研究设计。

Nat Rev Genet. 2011 Jun 1;12(7):465-74. doi: 10.1038/nrg2989.

Association studies for next-generation sequencing.下一代测序的关联研究。

Genome Res. 2011 Jul;21(7):1099-108. doi: 10.1101/gr.115998.110. Epub 2011 Apr 26.

Testing for an unusual distribution of rare variants.检测罕见变异的异常分布。

PLoS Genet. 2011 Mar;7(3):e1001322. doi: 10.1371/journal.pgen.1001322. Epub 2011 Mar 3.

To identify associations with rare variants, just WHaIT: Weighted haplotype and imputation-based tests.为了鉴定罕见变异的关联，只需 WHaIT：加权单体型和基于推断的检验。

Am J Hum Genet. 2010 Nov 12;87(5):728-35. doi: 10.1016/j.ajhg.2010.10.014. Epub 2010 Nov 4.

The role of polymorphisms in circadian pathway genes in breast tumorigenesis.昼夜节律通路基因多态性在乳腺癌发生中的作用。

Breast Cancer Res Treat. 2011 Jun;127(2):531-40. doi: 10.1007/s10549-010-1231-2. Epub 2010 Oct 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验